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GENES FOR ASSESSING CARDIOVASCULAR STATUS 
AND COMPOSITIONS FOR USE THEREOF 

FIELD OF THE INVENTION 

5 The present invention relates to genetic polymorphisms and polymorphism 

patterns useful for assessing cardiovascular status in humans. More particularly, the 
invention relates to identifying and using polymorphism patterns comprising a 
polymorphism in the a gene encoding a polypeptide selected from the group consisting of 
ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 
1 0 receptor, and P-adrenoceptor to predict a treatment outcome or likelihood of developing 
cardiovascular disease, and to assist in diagnosis and in prescription of an effective 
therapeutic regimen. 

BACKGROUND OF THE INVENTION 

The renin-angiotensin-aldosterone system (RAAS) plays an important role 
in cardiovascular physiology in mammals. Specifically, RAAS regulates salt-water 
homeostasis and the maintenance of vascular tone. Stimulation or inhibition of this 
system raises or lowers blood pressure, respectively, and disturbances in this system may 
be involved in the etiology of, for example, hypertension, stroke, and myocardial 
infarction. The RAAS system may also have other functions such as, e.g., control of cell 
growth. The renin-angiotensin system includes renin, angiotensin converting enzyme 
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(ACE), angiotensinogen (ACT), type 1 angiotensin II receptor (ATI), type 2 angiotensin II 
receptor (AT2) and aldosterone synthase. 

International Patent Application No. PCT/IB98/00475, filed April 1, 1998, 
discloses for the first time an association of polymorphism patterns in ACE, AGT, and 
5 ATI genes with cardiovascular status, particularly with the ability to predict the 
therapeutic outcome of a particular treatment regimen. 

Other genes express polypeptides involved in pathways other than RAAS 
that also play a role in the regulation of cardiovascular physiology, although it is believed 
that prior to the present invention, no known association of polymorphism patterns in 
1 o these genes with cardiovascular status has been observed or reported. Among these other 
regulators of cardiovascular physiology are endothelin, endothelin receptor, and 

P-adrenergic receptors 1 and 2. 

RAAS Pathway Co mponents 
AGT is the specific substrate of renin, an aspartyl protease. The human 
15 AGT gene contains five exons and four introns which span 13Kb (Gaillard et al, DNA, 
1989, 8:87-99; Fukam.zu et al, J. Biol. Chem., 1990, 265:7576-7582). The first exon (37 
bp) codes for the 5' untranslated region of the mRNA. The second exon codes for the 
signal peptide and the first 252 amino acids of the mature protein. Exons 3 and 4 are 
shorter and code for 90 and 48 amino acids, respectively. Exon 5 contains a short coding 
2 0 sequence (62 amino acids) and the 3*-untranslated region. 

Plasma AGT is synthesized primarily in the liver and its expression is 
positively regulated by estrogens, glucocorticoids, thyroid hormones, and angiotensin II 
(Ang II) (Clauser et al, Am. J. Hypertension, 1989, 2:403-410). Cleavage of the amino- 
terminal segment of AGT by renin releases a decapeptide prohormone, angiotensin-I, 

2 5 which is further processed to the active octapeptide angiotensin II by the dipeptidyl 

carboxypeptidase designated angiotensin-converting enzyme (ACE). Cleavage of AGT by 
renin is the rate-limiting step in the activation of the renin-angiotensin system. 

Several epidemiological observations indicate a possible role of AGT in 
blood pressure regulation. A highly significant correlation between plasma AGT 

3 0 concentration and blood pressure has been observed in epidemiological studies (Walker et 

al, J. Hypertension, 1979, 1 :287-291). Interestingly, a number of allelic dimorphisms 
have been identified in the AGT gene. The frequency of at least two of them (174M and 
235T) have been partially characterized and in certain populations shown to be 



BNSDOCID: <WO 00221 66A2_I_> 



WO 00/22166 _ 3 _ PCT/1B99/01678 

significantly elevated in hypertensive subjects (Jeunemaitre et aL, Cell, 1992, 71:169- 
180). In addition, a specific polymorphism, 235T, has been suggested to be directly 
involved in coronary atherosclerosis (Ishigami et aL, Circulation, 1995, 91:951-4). 
Futhermore, the presence of A or G at position 1218 in the AGT regulatory region has 
5 been correlated with differences in in vitro transcriptional capacity for this gene (Inoue et. 
ah, J. Clin. Invest., 1997, 99:1786). However, the foregoing are studies involving only 
one or at most two polymorphisms. Furthermore, the sole disclosed use is susceptibility to 
disease. 

The human ACE gene is also a candidate as a marker for hypertension and 
10 myocardial infarction. ACE inhibitors constitute an important and effective therapeutic 

approach in the control of human hypertension (Sassaho et aL, Am. J. Med., 1987, 83:227- 

235). In plasma and on the surface of endothelial cells, ACE converts the inactive 

angiotensin I molecule (Ang I) into active angiotensin II (Ang II) (Bottari et aL, Front. 

Neuroendocrinology, 1993, 14:123-171). Another ACE substrate is bradykinin, a potent 
15 vasodilator and inhibitor of smooth muscle cell proliferation, which is inactivated by ACE 

(Ehlers et aL, Biochemistry, 1989, 28:531 1-5318; Erdos, E.G., Hypertension, 1990, 

16:363-370; Johnston, C.I. Drugs (suppl. 1), 1990,39:21-31). 

Levels of ACE are very stable within individuals, but differ greatly between 

individuals. A greater risk of myocardial infarction has been identified in a group of 

2 0 subjects with an ACE polymorphism designated ACE-DD (Cambien et aL, Nature, 1992, 

359:641-644), and a 12-fold greater risk of myocardial infarction has been identified in a 
subgroup of patients having a combination of the ACE polymorphism ACE-DD and one 
of the AGT polymorphisms (235T) described above (Kamitani et aL, Hypertension, 1994, 
24:381). Recently, six ACE polymorphisms were identified and characterized (Villard et 
25 aL, Am. J. Human Genet., 1996, 58:1268-1278). 

The vasoconstrictive, cell growth-promoting and salt conserving actions of 
angiotensin II are mediated through binding to and activation of angiotensin receptors, of 
which at least two types have been cloned (ATI and AT2). The type 1 Ang II receptor 
(ATI), a G-protein-coupled seven transmembrane domain protein, is widely distributed in 

3 0 the body and mediates almost all known Ang II effects (Fyhrquist et aL, J. Hum. 

Hypertension, 1995,5:519-524). 

Several polymorphisms have been identified in the ATI receptor gene. 
Initial studies suggest that at least one of them is more frequent in hypertensive subjects 
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(AT 1,6b C)(Bonnardeaux et al, Hypertension, 1994, 24:63-69). This polymorphism, 
combined with the ACE-DD polymorphism, has been shown to correlate strongly with the 
risk of myocardial infarction (Tiret et al, Lancet, 1994, 344:910-913). 

5 Kndothelin (ET> Regulation of Ca rdiovascular Physiology 

Endothelin is a potent vasoconstrictive peptide characterized by long 
lasting action. It was first discovered as a vasoconstricting factor in conditioned medium 
(Hickey et al, Am. J. Physiol., 1985, 248:C550), and subsequently purified and 
characterized (Yanagisawa et al, Nature, 1988, 332:41 1). ET is produced as 
1 0 preproendothelin, which is cleaved after removal of the signal sequence by an 

endopeptidase, followed by cleavage with endothelin converting enzyme (Xu et al., Cell, 
1994, 78:473: Shimada et al., J. Biol. Chem., 1994, 269:18274). Analysis of the human 
ET gene has revealed the existence of two additional ET-like peptides expressed in various 
tissues, termed ET-2 and ET-3 (Inoue et al, Proc. Natl. Acad. Sci. USA, 1989, 86:2863). 
1 5 The first endothelin was accordingly termed ET- 1 . 

One of the important discoveries following characterization of ETs was the 
discovery of two ET receptors, ET A and ET B (Arai et al, Nature, 1990, 348:730; Sakumi 
et al, Nature, 1990, 348:782). Both belong to the family of heptahelical G-protein 
coupled receptors. There is 68% amino acid identity between the two receptor subtypes. 
2 0 ET A exists as a single copy gene located on human chromosome 4 (Hosoda et al, J. Biol. 
Chem., 1992, 267:18797; Cyr et al, Biochem. Biophys. Res. Commun., 1991, 181:184). 
ET B exists as a single copy gene located on human chromosome 13 (Arai et al, J. Biol. 
Chem., 1993, 268:3463-70), although a splice variant of ET B has been found (Shyamala 
et al, Cell. Mol. Biol. Res., 1994, 40:285-96). The cDNA sequence of ET A has been 
2 5 deposited with GenBank with accession number S57498. 

In the early stage of ET research, a great number of pharmacological 
studies suggested that the responses to ETs could be divided into two groups according to 
the pharmacological potency of the three peptides. Indeed, these two receptors, ET A and 
ET B , are distinct in their ligand binding affinity and distribution in tissues and cells. ET A 
30 has a high affinity to ET- 1 and ET-2, but a low affinity to ET-3 . ET B has equally potent 
affinities to all three endogenous ETs. ET A exists on smooth muscle and mediates 
vasoconstriction. In contrast, ET B exists on endothelium and mediates the release of 
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relaxing factors such as nitric oxide and prostacycline. However, several reports 
demonstrated that ET B on some vascular smooth muscle also mediated vasoconstriction. 

Cloning of the ET receptor gene facilitated the development of ET-receptor 
antagonists, such as BE-18257B, and BQ-123 and FR139317, two derivatives of 
5 BE-18257B (See Masaki, Cardiovascular Res., 1998, 39:530). Many selective and 
non-selective antagonists for ET A and ET B have emerged. 

Although the pathophysiological role of ET is still unclear, ET antagonists 
demonstrated significant beneficial effects in pathological conditions, including congestive 
heart failure, pulmonary hypertension, cerebrovascular spasm after subarachnoid 
10 hemorrhage, acute renal failure, and essential hypertension. ET or ET receptor knockout 
mice have also provided important information regarding the physiological and 
pathophysiological significance of ET (Masaki, supra). In particular, mice with a 
knockout ofET-3 or the ET B receptor genes exhibit phenotypic changes that resemble 
Hirschsprung's disease, a human hereditary syndrome associated with a missense mutation 
15 of the ET B gene (Pfiffenberger, et aL, Cell, 1994, 79:1257). 

Despite advances in understanding the role of the endothelin pathway in the 
treatment of cardiovascular diseases, questions remain. Not all experimental models of 
hypertension respond to endothelin antagonists, and it remains a uncertain whether 
endothelin antagonists improve cardiac structure and function beyond the benefits of blood 
2 0 pressure reduction (Moreau, Cardiovascular Res., 1998, 39:534). Thus, there is a need in 
the art for a reliable and effective means for predicting whether endothelin antagonists will 
be effective for treating hypertension in a given individual. 

P-Adrenergic Receptors (ft- Adrenoceptors) 

2 5 The adrenoceptors fall into three major groups, a,, a 2 , p, within each of 

which further subtypes can be distinguished pharmacologically (Lullmann, et al in Color 
Atlas of Pharmacology, New York, 1993). Adrenergic receptors are all G-protein linked. 
They are involved in regulation of the cardiovascular system, and in the control of 
metabolic activity, e.g., insulin secretion and glucose release. They also mediate 

3 0 constriction or relaxation of smooth muscle cells in the respiratory, gastrointestinal, and 

genitourinary tracts (Berne and Levy, Principles of Physiology (2 nd Ed.), Mosby-Year 
Books, Inc., 1996, pp. 691-696). 
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Adrenoceptors are targets for epinephrine and norepinephrine, which are 
representatives of the family of monoamine neurotransmitters. Epinephrine has equally 
high affinity for all a- and p-receptors while norepinephrine differs from epinephrine by 
its low affinity for p 2 -receptors (The Biochemical Basis of Neuropharmacology, (7"' Ed.) 
5 New York, 1996, pp. 226-292). The adrenoceptors themselves interact preferentially with 
three different classes of G-proteins: G s (P-adrenoceptors) mediating activation of 
adenylate cyclase, G, (^-adrenoceptors) mediating inhibition of adenylate cyclase, and G q 
(a.-adrenoceptors) mediating activation of phospholipase C (Hieble et al, J. of Med. 
Chem., 1995, 38:3415-3444). 
! o The pharmacological interest of adrenoceptors is mainly for the treatment 

of cardiovascular diseases, e.g.. through the development of P-antagonists, ^-antagonists 
and cc-agonists to treat hypertension, but they are also considered important for the 
treatment of asthma (P 2 -agonists). 

The p 2 -adrenergic receptor is expressed on a number of cell types, e.g., 
1 5 bronchial smooth muscle, where its activation results in relaxation and bronchial 
dilatation. These receptors are also being expressed on epithelial cells, vascular 
endothelium, alveolar walls, immune cells, and presynaptic nerve terminals (Liggett, Am. 
J. Respir. Crit. Care. Med., 1997, 156:S156-S162). Cardiac cells express mainly p,-, but 
also a small fraction of p 2 -adrenoceptors (Collins et al, Biochimica et Biophysica Acta, 
2 0 1993, 1 172: 171-174). P,-adrenoceptors are also expressed in brain and pineal gland. 

^-Adrenoceptor Function 
The P,- and p 2 -adrenergic receptors are coupled to a G s -protein complex, 
which activates adenylate cyclase. Agonist binding to the p, -receptor, located in cardiac 

2 5 muscle cells, mediates increased contractility and cardiac output (Lullmann, supra). 

Agonist binding to the p 2 -receptor, located in peripheral vascular arteries, mediates 
vasodilation by increasing the amount of cAMP, and thereby inhibiting activation of 
myosin kinase, which is necessary for smooth muscle cell constriction {ibid.) Activation 
of cAMP in cardiac cells by agonist binding to both P,- and P- 2 -adrenoceptors activates the 

3 0 cAMP-dependent protein kinase (PKA) (Castellano and Bdhm, Hypertension, 1 997, 

29:715-722). P-adrenoceptors also regulate the control of melatonin production in the 
pineal gland, by the cAMP activation of one of the enzymes (5-HT-N-acetyl transferase) 
involved in the synthesis of melatonin (Collins et al., supra). 
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The p r receptors mediate increased conversion of glycogen to glucose 
(glycogenosis) in both the liver and skeletal muscle (H. Liillmann, et aL, supra), and 
stimulate influx of potassium into muscle cells to prevent hyperkalemia (Berne and Levy, 
supra). 

5 The P-receptors are regulated on the protein level by desensitization. The 

initial desensitization process results from the phosphorylation of serine and threonine 
residues in the cytoplasmic tail or third intracellular loop by several protein kinases, 
including pARK and PKA (Hieble et ah, supra). pARK phosphorylates specific serine or 
threonine residues in the C-terminal of receptors that are occupied by an agonist. The 
10 phosphorylation triggers binding of the cytostolic protein P-arrestin and results in the 

uncoupling from G S£X . PKA is activated by cAMP and phosphorylates the P 2 -adrenoceptor 
by a relatively slow process. The phosphorylated receptor loses the ability to activate G s 
(Castellano and Bohm, supra). Prolonged interaction of agonists with adrenoceptors 
generally results in receptor desensitization. 

15 

^-Adrenoceptor Gene Structure 
The human p r adrenoceptor gene is located on the long arm of chromosome 
10, the same chromosome as for the a 2A -adrenoceptor gene. The coding sequence of this 
gene is deposited with GenBank, accession number X69168. The regulatory region is also 
2 0 deposited with GenBank, accession number J030 19. It codes for an intronless gene 

product of 1431 base pairs (Hall, Thorax, 1996,51:351-353). Both the promoter and the 
coding region of the gene are rich in G and C residues, which make up greater than 70% of 
the bases. The promoter does not contain any paired consensus TATA box and CAAT 
box elements but instead clusters with an inverted CAAT box and S? ] or AP-2 binding 

2 5 motifs. This type of receptor, reminiscent of "housekeeping genes", has been described 

for other G-protein coupled receptors as well (Collins et aL, supra). 

The human p 2 -adrenoceptor gene is located on the long arm of chromosome 
5, the same chromosome as the oc m -adrenoceptor gene. The coding sequence has been 
deposited with GenBank, with accession numbers Ml 5 169, J02728, or Ml 61 06. The 

3 0 regulatory region sequence is also deposited with GenBank, accession number Y00106. It 

codes for an intronless gene product of 1239 base pairs (Hall, supra). The promoter region 
is 200-300 bases 5' of the translation initiation codon, and it can form strong secondary 
structures due to high G-C content. There are two TATA boxes (separated by roughly 10 
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bp) and a CAAT box located approximately 30 and 80 base pairs upstream, respectively, 
from the mRNA start region. 

^-Adrenoceptor Gene Regulation 

There are some regulatory regions identified in the promoter region of the 
P, -adrenoceptor gene: a cAMP response element (CRE), a consensus thyroid response 
element (TRE), and a glucocorticoid response element (GRE). This is consistent with the 
evidence that both thyroid hormone and corticosteroids affect adrenergic sensitivity in 
both heart and adipose tissue. The CRE region might have a self-regulatory function, as 
has been shown for the p 2 -adrenoceptor gene (Collins et al., supra). 

There are several regulatory domains in the 5' flanking region of the 
P 2 -adrenoceptor. Among these is a cAMP-responsive element (CRE), which is recognized 
and stimulated by a phosphoprotein called CRE binding protein (CREB). CREB is 
partially under the control of PKA-dependent phosphorylation processes. This is seen as 
an increase in P 2 -adrenoceptor mRNA level in the early phase after exposure to P-agonists. 
However, the level of mRNA is decreased after prolonged exposure to agonists, probably 
mediated by a shortening of mRNA half-life (Castellano and Bohm, supra). It has also 
been shown that transcription of the P 2 -adrenoceptor gene is upregulated by stimulation 
with glucocorticoids in a variety of tissues (Collins et al., supra). In the 3' flanking region 
there are sequences homologous to glucocorticoid response elements. These might be 
responsible for the increased expression of p 2 adrenoceptor observed in transfected cells 
after treatment with hydrocortisone (Emorine and Marullo, Proc. Natl. Acad. Sci. USA, 
1987, 84:6995-6999). 

^-Adrenoceptor Protein Structure 
The proposed model for P-adrenoceptors is like most of the G-protein 
binding receptors, a seven a-helical transmembrane structure, where the seven a-helices 
are radially arranged around a central "pore", in which the receptor ligands bind. The 
P-adrenoceptors have an extracellular glycocylated N-terminus, and an intracellular 
C-terminus. The P r receptor consists of 477 amino acids; the P 2 -receptor consists of 413 
amino acids. 

The overall amino acid identity of human p r and p 2 - adrenoceptors is only 
54%. However, it is likely that the pharmacological differences between P 2 -receptors and 
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P, -receptors are due to subtle changes in orientation of the primary binding sites, resulting 
in a slightly different binding site rather than to specific amino acid substitutions (Hieble 
et a/., J. Med. Chem. 1995, 38:3415-3444). 

Site-directed mutagenesis has demonstrated that an aspartic acid residue, 

5 Asp- 1 13, located in the third transmembrane-spanning helix, and two serine residues, 
Ser-204 and Ser-207, are required for full agonist binding to the p 2 -adrenoceptor. The 
P, -adrenoceptor contains identical amino acid residues located in corresponding positions 
to those shown to be important for agonist binding to the P 2 -adrenoceptor. Another 
aspartic acid residue, Asp-79 ; located in the second a-helix of both p-receptors is highly 

1 0 conserved in G-protein coupled receptors (Hieble et al, supra). Ser-3 1 9 has a potential 
role in agonist binding to the P 2 -adrenoceptor. 

Mutation of Tyr-350, located in the cytoplasmic tail of the p 2 -receptor, 
interferes with coupling of the receptor to G s (Hieble et al. 9 supra). Also, palmitoylation 
of Cys-341 in the C-terminal enables the P 2 -adrenoceptor to form a fourth 

1 5 intracytoplasmic loop, which increases the ability of the agonist-bound receptor to mediate 
adenylyl cyclase stimulation (Strosberg, Preotein Science, 1993, 2:1 198-1209). 

^-Adrenoceptors as Prim Targets 
No cause of disease can be identified in 80-90% of patients with 
2 0 hypertension. They have so-called essential hypertension, which affects 5-10% of the 
general population, and is the most common cause of disease in developed countries ( J. 
Axford, Medicine, Blackwell Science Ltd., 1996, 10.1 19-10.130). Betablockers have been 
widely used in the treatment of hypertension. They are particularly useful for the 
treatment of juvenile hypertension with tachycardia and high cardiac output. Betablockers 

2 5 or beta-adrenergic blockers were first introduced as a treatment for essential hypertension 

in 1964, and are still recommended as first choice because the cost for betablockers is low, 
which improves patient compliance. They act by binding to p, -receptors on the cardiac 
smooth muscle cells, which leads to decreased cardiac output. Most betablockers are not 
specific P, -receptor antagonists but binds to P 2 -receptors as well. The binding to 

3 0 p.-receptors gives the opposite of the desired effect though inhibition of p 2 -receptors leads 

to vasoconstriction. This gives a side effect with cold hands and feet because most of the 
P 2 -receptors are located in the peripheral vascular arteries. 
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They have also been known to cause bronchospasms as well as some 
central nervous system side effects (nightmares, somnolence). They decrease insulin 
secretion, which makes them inappropriate to treat hypertensives with diabetes mellitus, 
and they can cause heart failure and peripheral artery obstructive disease (Velaseco and 
5 Rodrigues, Journal of Human Hypertension 10, Suppl. 1, S77-S80, 1996). 

P-adrenoceptor agonists, such as dopamine and dobutamine are used to 
stimulate myocardial P,-adrenoceptors in the acute management of congestive heart 
failure. They act by increasing contractility and cardiac output. 

A lot of different p 2 -agonists are used in the treatment of asthma. They 
10 exert their primary effect on the p 2 -adrenergic receptor of bronchial smooth muscle, 
resulting in relaxation and bronchial dilatation. They also protect against 
bronchoconstrictor challenge (Hall, supra). 

Thus, there is a clear need in the art for an improved understanding of the 
effects of betablockers on different subjects, and to predict which patients will have a 
15 better response to treatment with betablockers. 

Need for Effective Cardiovascular Status Assessment 
The high morbidity and mortality associated with cardiovascular disease 
demonstrate a need in the art for methods and compositions that allow the determination 
2 0 and/or prediction of the therapeutic regimen that will result in the most effective treatment 
outcome in a patient suffering from cardiovascular disease. This includes identification of 
individuals who are more or less responsive to particular therapeutic regimens, including, 
e.g., particular drugs that are conventionally used to treat cardiovascular disease. 

Furthermore, the heterogeneity in responses to cardiovascular therapies 

2 5 emphasizes a need for another approach to rational drug development. In particular, 

populations that are identified as non-responsive to a particular therapeutic regimen can be 
identified for development and testing of alternative regimens. Thus, effective treatment 
regimens could be developed for a larger percentage of the affected population. 

In summary, there is a need to reduce or eliminate trial and error in 

3 0 selecting a therapeutic regimen for a particular individual. It would be desirable instead to 

predict whether a given individual will be responsive to, e.g., a particular class of drugs or 
even to a particular drug or whether he/she is likely to suffer from adverse reactions or 
side-effects. 
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There is also a need in the art for methods and compositions that allow the 
identification of individuals having a predisposition to cardiovascular disease, such as, 
e.g., myocardial infarction, hypertension, atherosclerosis, and stroke, to facilitate early 
intervention and disease prevention. 

The present invention addresses these and other needs in the art by 
providing polymorphisms and polymorphic patterns that are characteristic of 
cardiovascular status, and by using these polymorphisms and patterns to prescribe or to 
develop more effective treatments or to assist in diagnosis. 

Citation of any reference in this application should not be construed as an 
admission that the reference is prior art to the invention. Each cited reference is 
incorporated herein by reference in its entirety. 

SUMMARY OF THE INVENTION 

In one aspect, the present invention provides reagents and methods for 
predicting whether a particular therapeutic regime (such as a specific drug, a class of drugs 
or any other therapeutic regime, pharmacological or not) would be effective in improving 
a cardiovascular condition in a human individual, or would be ineffective for that purpose, 
or its use would be associated with adverse reactions or undesirable side-effects. 

A particular advantage of the invention is that one or more polymorphic 
markers provide a basis for predicting the outcome of a treatment regimen. By comparing 
a polymorphic pattern of a subject who requires treatment for a cardiovascular disorder, 
for example hypertension, with a reference pattern previously established to correlate with 
responsivity to the treatment regimen, a physician can predict whether a treatment plan, 
such as administration of an ACE inhibitor, is likely or not to be effective before 
subjecting the subject to the treatment plan. For example, a comparison of the test 
polymorphic pattern from an individual with reference polymorphic patterns of individuals 
exhibiting differing responses to a particular therapeutic intervention can be used to 
predict the type or degree of responsivity of the individual to such intervention. The 
present invention thus represents a significant breakthrough in treating cardiovascular 
pathologies in that it reduces or eliminates trial and error in selecting a treatment for a 
particular individual cardiovascular patient. 

An additional advantage of the invention derives from the ability to 
eliminate subjects from clinical trials who are predictably non-responsive, or at risk for an 
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adverse response, to a particular treatment regimen. Furthermore, adverse results in an 
early trial can be evaluated to identify polymorphic patterns, so that the adverse results can 
be correlated with a sub-population of the test population permitting exclusion of such 
sub-population from the treatment group. The invention may thus ensure that a beneficial 
5 drug can be approved for use in the appropriate population, and decrease the number of 
required patients and therefore the duration and cost of clinical trials. It may also lead to 
identification of another subgroup which can be the target for development of another 
therapeutic regimen. 

All of the foregoing applications within the scope of the invention can be 
1 0 deemed to be assessments of an individual's cardiovascular status, as the term is broadly 
defined below. 

The foregoing methods of the invention are carried out by comparing a test 
polymorphic pattern established by at least one polymorphic position within a gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
1 5 endothelin receptor, and P-adrenoceptor with a polymorphic pattern of a population of 

individuals exhibiting a predetermined responsivity to the regimen (reference pattern). If 
the test pattern matches the reference pattern, there is a statistically significant probability 
that the individual has the same cardiovascular status as that correlated with the reference 
pattern. 

2 o The polymorphic pattern preferably consists of at least two (and more 

preferably at least three) polymorphic positions, at least one of which is in the gene 
encoding a polypeptide from the group consisting of ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, and a 
second polymorphism in a gene encoding a polypeptide selected from the group consisting 

2 5 of ACE, AT 1 , AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 

receptor, and p-adrenoceptor. 

Additionally, the invention provides methods for assessing whether a 
particular individual has a genetic predisposition to a cardiovascular pathology. This 
aspect of the invention comprises comparing a test polymorphic pattern established by at 

3 0 least one and preferably at least two and most preferably at least three polymorphic 

positions within a gene encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, and p-adrenoceptor in conjunction with one or 
more, and preferably two or more, other polymorphic positions in ACE, ATI , AGT, renin, 
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aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or p- 
adrenoceptor, with a polymorphic pattern of individuals exhibiting a predisposition to a 
cardiovascular syndrome. The conclusion drawn depends on whether the individual's 
polymorphism pattern matches the reference pattern. 
5 The invention also provides an isolated nucleic acid having a sequence 

corresponding to part or all of the gene encoding ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, the 
nucleic acid comprising a polymorphism in the ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor gene. In 
1 0 preferred embodiments, the polymorphism, in combination with one or more other 

polymorphisms in the sequence of the same gene or a gene encoding a protein selected 
from the group consisting of human ACE, ATI, AGT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, and P-adrenoceptor, is predictive of a 
particular type or level of responsivity to a given treatment, or indicates a predisposition to 
1 5 one or more clinical syndromes associated with cardiovascular disease, or both. 

The isolated polymorphisms according to the invention (which are 
described using the numbering indicated in Table 1 below) include without limitation: 

Nucleic acids encoding renin having one or more polymorphic positions in 
the last exon of the gene (exon 10), which is a cytosine to thymine transition that creates a 
2 0 premature stop codon at position 387. 

Nucleic acids encoding aldosterone synthase promoter at position -344. 
Nucleic acids encoding p-adrenergic receptor- 1 regulator region, at 
positions 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 2757, as numbered in 
GenBank accession number X69168. In preferred embodiments, the bases at specific 

2 5 positions are 2238 G, 2238 A, 2577 C, 2257 T, 2757 A, and 2757 G. 

Nucleic acids encoding p-adrenergic receptor- 1 coding region, at positions 
231, 758, 1037, 1251, 1403, and 1528, as numbered in GenBank accession number 
J03019. In preferred embodiments, the bases at specific positions are 23 1 A, 23 1 G, 125 1 
C, 1251 G, 1403 A, 1403 G, 1528 C, and 1528 A. 

3 o Nucleic acids encoding p-adrenergic receptor-2 regulatory region, at 

positions 932,934, 987, 1006, 1120, 1221, 1541, and 1568, as numbered in GenBank 
accession number M 1 5169, or J02728, or Ml 6 106. In preferred embodiments, the bases 
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at specific positions are 934 A ? 934 G, 987 C, 987 G, 1006 A, 1006 G, 1 120 C, 1 120 G, 
1221 C, 1221 T, 1541 C, 1541 T, 1568 C, and 1568 T.. 

Nucleic acids encoding p-adrenergic receptor- 2 coding region, at positions 
839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070, as numbered in GenBank 
5 accession number Y00106. In preferred embodiments, the bases at specific positions are 
839 A, 839 G, 872 C, 872 G. 1045A, 1045 G, 1284 C, 1284 T, 1316 A, 1316 C, 1846 C, 
1846 G, 2032 A, 2032 G, 2068 no insert, 2068 G, 2068 C, 2070 no insert, and 2070 C. 

Nucleic acids encoding endothelin-A receptor at positions 969, 1005, 1 146, 
and 2485, as numbered in GenBank accession number S57498. In preferred embodiments, 
10 the bases at specific positions are 969 C, 969 T, 1005 A, 1005 G, 1 146 A, 1 146 G, 2485 T, 
and 2485 C. 

Nucleic acids comprising polymorphisms present in other genes, which 
can be used in combination with a polymorphism from a gene encoding a polypeptide 
selected from the group consisting of ACE, ATI, AGT, renin, aldosterone synthase, type-2 
15 angiotensin II receptor, endothelin receptor, and p-adrenoceptor to establish a 

polymorhism pattern, have been disclosed in International Patent Application No. 
PCT/IB98/00475, and include: 

(i) Nucleic acids encoding ACE having one or more polymorphic positions 
at the position in the regulatory region numbered 5106; positions in the coding region 

20 numbered 375, 582, 731, 1060, 2741, 3132, 3387, 3503, and 3906; and position 1451 all 
positions as numbered in GenBank entry X62855. In preferred embodiments, the 
sequences at the polymorphic positions in the ACE regulatory region are one or more of 
5 106C and 5 106T; and the sequences at the polymorphic positions in the coding region are 
one or more of 375A, 375C, 582C, 582T, 731 A, 73 1G, 1060G, 1060A, 2741G, 2741T, 

2 5 3132C, 3132T, 3387T, 3387C, 3503G, 3503C, 3906G, and 3906A. The invention also 
encompasses a nucleic acid encoding a deletion of nucleotides 1451-1783 as numbered in 
GenBank entry X62855. 

(ii) Nucleic acids encoding AGT having one or more polymorphic positions 
at positions in the regulatory region numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 

30 and 1204; positions in the coding region numbered 273, 912, 997, 1116, and 1 174; and 
position 49 as numbered in GenBank entry M24688. In preferred embodiments, the 
sequences at the polymorphic positions in the AGT regulatory region are one or more of 
395T, 395A, 412C, 412T, 432G, 432A, 449T, 449C, 692C, 692T, 839G, 839A, 1007G, 
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1007A, 1072G, 1072A, 1204C, and 1204A; the sequences at the polymorphic position in 
the coding region are one or more of 273C, 273T, 912C, 912T, 997G, 997C, 1 1 16G, 
1 1 16A, 1 174C and 1 174 A; and the sequence at position 49 in GenBank entry M24688 is 
either A or G. 

5 (iii) Nucleic acids encoding ATI having one or more polymorphic 

positions at positions in the regulatory region numbered 1427, 1756, 1853, 2046, 2354, 
2355, and 24 1 5; and the position in the coding region numbered 449. In preferred 
embodiments, the sequences at the polymorphic positions in the ATI regulatory region are 
one or more of 1427 A, 1427T, 1756T, 1756A, 1853T, 1853G,2046T, 2046C, 2354A, 
10 2354C, 2355G, 2355C, 241 5A and 2415G; and the sequences at the polymorphic positions 
in the coding region are one or more of 449G, 449C, 678T, 678C, 11 67 A, 1167G, 1271A, 
and 1271C. 

The invention also encompasses libraries of isolated nucleic acid 
sequences, such as arrays on a solid surface, wherein each sequence in the library 

1 5 comprises a polymorphic position in the gene encoding ACE, ATI , AGT, renin, 
aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or p- 
adrenoceptor and other polymorphic positions in the other genes, including without 
limitation the polymorphic positions and sequences disclosed herein. Also provided are 
nucleic acid probes that hybridize specifically to the identified polymorphic positions; 

2 0 peptides and polypeptides comprising polymorphic positions; and polymorphism-specific 
antibodies, i.e., sequence-specific antibodies that bind differentially to polymorphic 
variants of the foregoing genes, that can be used to identify particular polymorphic 
variants. 

In another aspect, the invention provides kits for the determination of 
2 5 polymorphic patterns in an individual's genes. The kits comprise a means for detecting 

polymorphic sequences, including without limitation oligonucleotide probes that hybridize 
at or adjacent to the polymorphic positions and polymorphism-specific antibodies. 

In yet another aspect, the invention provides nucleic acid and polypeptide 
targets for use in screening methods to identify candidate cardiovascular drugs. Nucleic 
30 acid targets may be, e.g., DNA or RNA and are preferably at least about 10, and most 
preferably at least about 15, residues in length and comprise one or more polymorphic 
positions in a gene encoding a polypeptide from the group consisting of ACE, ATI, AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, and p- 
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adrenoceptor. Peptide targets are at least about 5 amino acids in length and may be as 
large or larger than the full-length polypeptides. 



DETAILED DESCRIPTION OF THE INVENTION 

The invention in based, in part, on the discovery that polymorphisms in 
certain genes in the RAAS, endothelin, and (i-adrenoceptors pathways define 
polymorphism patterns that correlate with cardiovascular status. Most significantly, by 
comparing a test individual's polymorphism pattern with a reference polymorphism 
pattern, which is a polymorphism pattern from a population of individuals with known 
cardiovascular status, one is able to predict whether the test individual has an increased 
likelihood to share the same cardiovascular status as that correlated with the reference 
polymorphism pattern. In particular, particular patterns correlate with responsiveness to 
ACE inhibitors, non-responsiveness to ACE inhibitors, and predisposition to 
cardiovascular diseases or dysfunctions, including myocardial infarction and stroke. 

The invention provides a powerful predictive tool for clinical testing and 
treatment of cardiovascular disease. For clinical testing, the present invention permits 
smaller, more efficient clinical trials by identifying individuals who are likely to respond 
poorly to a treatment regimen and reducing the amount of uninterpretable data. By 
evaluating a test individual's polymorphism pattern, a physician can prescribe a 
prophylactic or therapeutic regimen customized to that individual's cardiac status. 
Adverse responses to particular therapies can be avoided by excluding those individuals 
whose cardiovascular status puts them at risk for that therapy. Appropriate changes in 
lifestyle, including diet, environmental stress, and exercise levels can be prescribed for 
individuals whose test polymorphic pattern matches a reference pattern that correlates with 
increased predisposition to cardiovascular disease. 

Definitions 

"Cardiovascular status" as used herein refers to the physiological status of 
an individual's cardiovascular system, as reflected in one or more status markers or 
indicators including genotype. Cardiovascular status shall be deemed to include without 
limitation not only the absence or presence of a pathology or disease in one or more 
components of the individual's cardiovascular system and the individual's predisposition 
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to developing such a condition, but also the individual's responsivity, i.e.. the ability or 
inability of the individual to respond (positively or negatively) to a particular prophylactic 
or therapeutic regimen or treatment for a cardiovascular condition, such as a drug or a 
class of drugs. A negative response includes one or more adverse reactions and side 
5 effects. Status markers include without limitation clinical measurements such as, e.g., 
blood pressure, electrocardiographic profile, differentiated blood flow analysis, and the 
presence of increased levels of cellular proteins associated with a cardiovascular event. 
Examples of such proteins, also called diagnostic markers, which are important in cardiac 
events include myosin light chain, myosin heavy chain, myoglobin, troponin I, troponin T, 
L0 CK-MB, etc. (see U.S. Patents No. 5,604,105 and No. 5,744,358). Status markers 

according to the invention are assessed using conventional methods well known in the art. 
Also included in the evaluation of cardiovascular status are quantitative or qualitative 
changes in status markers with time, such as would be used, e.g., in the determination of 
an individual's response to a particular therapeutic regimen or of a predisposed 
1 5 individual's eventual development of a cardiovascular condition. 

Examples of cardiovascular syndromes that are included in the foregoing 
definition of cardiovascular status include diagnosis of, or predisposition to, one or more 
cardiovascular syndromes, such as, e.g., hypertension, acute myocardial infarction, silent 
myocardial infarction, unstable angina, stroke, and atherosclerosis. It will be understood 
2 0 that a diagnosis of a cardiovascular syndrome made by a medical practitioner encompasses 
not only clinical measurements but also medical judgment. 

"Responsivity", as used herein, refers to the type and degree of response an 
individual exhibits to a particular therapeutic regimen, i.e., the effect of a treatment on an 
individual. Responsivity breaks down into three major categories: therapeutic effect; no 
2 5 effect; and adverse effect. Naturally, there can be differing degrees of a therapeutic effect, 
e.g., between full elimination and partial elimination of symptomology. In addition, 
adverse effects, or side effects, may be observed even though the treatment is beneficial, 
i.e., therapeutically effective. Indeed, the present invention may permit identification of 
individuals with complex responsivity traits or patterns. 
30 a "predisposition to develop a cardiovascular syndrome" refers to an 

increased likelihood, relative to the general population, to develop a cardiovascular 
syndrome, as defined above. A predisposition does not signify certainty, and development 
of the syndrome may be forestalled or prevented by prophylaxis, e.g., adopting a modified 
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diet, exercise program, or treatment with gene therapy or pharmaceuticals. Naturally, an 
advantage of the present invention is that it permits identification of individuals who are, 
based on their genotype, predisposed to develop a cardiovascular syndrome, and for whom 
prophylactic intervention can be especially important. 

A "polymorphism" as used herein denotes a variation in the nucleotide 
sequence of a gene in an individual. Genes that have different nucleotide sequences as a 
result of a polymorphism are "alleles". A "polymorphic position" is a predetermined 
nucleotide position within the sequence. In some cases, genetic polymorphisms are 
reflected by an amino acid sequence variation, and thus a polymorphic position can result 
in location of a polymorphism in the amino acid sequence at a predetermined position in 
the sequence of a polypeptide. An individual "homozygous" for a particular 
polymorphism is one in which both copies of the gene contain the same sequence at the 
polymorphic position. An individual "heterozygous" for a particular polymorphism is one 
in which the two copies of the gene contain different sequences at the polymorphic 
position. 

A "polymorphism pattern" as used herein denotes a set of one or more 
polymorphisms, including without limitation single nucleotide polymorphisms, which 
may be contained in the sequence of a single gene or a plurality of genes. In the simplest 
case, a polymorphism pattern can consist of a single nucleotide polymbrphism in only one 
position of one of two alleles of an individual . However, one has to look at both copies of 
a gene. A polymorphism pattern that is appropriate for assessing a particular aspect of 
cardiovascular status (e.g., predisposition to hypertension) need not contain the same 
number (nor identity, of course) of polymorphisms as a polymorphism pattern that would 
be appropriate for assessing another aspect of cardiovascular status (e.g., responsivity to 
ACE inhibitors for control of hypertension). A "test polymorphism pattern" as used herein 
is a polymorphism pattern determined for a human subject of undefined cardiovascular 
status. A "reference polymorphism pattern" as used herein is determined from a 
statistically significant correlation of patterns in a population of individuals with pre- 
determined cardiovascular status. 

"Nucleic acid" or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. Nucleic acids 
include without limitation single- and double-stranded molecules, i.e., DNA-DNA, DNA- 
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RNA and RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by 
conjugating bases to an amino acid backbone. This also includes nucleic acids containing 
modified bases and non-naturally occurring phosphoester analog bonds, such as 
phosphorothioates and thioesters. The term nucleic acid molecule, and in particular DNA 
5 or RNA molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double-stranded 
DNA found, inter alia, in linear or circular DNA molecules {e.g., restriction fragments), 
plasmids, and chromosomes. In discussing the structure of particular double-stranded 
DNA molecules, sequences may be described herein according to the normal convention 
10 of giving only the sequence in the 5' to 3* direction along the nontranscribed strand of 
DNA (i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 
15 generally of at least 10, preferably at least 15, and more preferably at least 20 nucleotides, 
that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA 
molecule encoding a gene, cDNA, mRNA, or other nucleic acid of interest. 
Oligonucleotides can be labeled, e.g., with 32 P-nucleotides or nucleotides to which a label, 
such as biotin, has been covalently conjugated. In one embodiment, a labeled 
2 0 oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another 
embodiment, oligonucleotides (one or both of which may be labeled) can be used as PCR 
primers, either for cloning full length or a fragment of a gene of interest, or to detect the 
presence of nucleic acids encoding the gene of interest. In a further embodiment, an 
oligonucleotide of the invention can form a triple helix with a double stranded sequence of 
2 5 interest in a DNA molecule. In still another embodiment, a library of oligonucleotides 
arranged on a solid support, such as a silicon wafer or chip, can be used to detect various 
polymorphisms of interest. Generally, oligonucleotides are prepared synthetically, 
preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be prepared 
with non-naturally occurring phosphoester analog bonds, such as thioester bonds. 
30 An "isolated" nucleic acid or polypeptide as used herein refers to a nucleic 

acid or polypeptide that is removed from its original environment (for example, its natural 
environment if it is naturally occurring). An isolated nucleic acid or polypeptide contains 
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less than about 50%, preferably less than about 75%, and most preferably less than about 
90%, of the cellular components with which it was originally associated. 

A nucleic acid or polypeptide sequence that is "derived from" a designated 
sequence refers to a sequence that corresponds to a region of the designated sequence. For 
5 nucleic acid sequences, this encompasses sequences that are identical to or complementary 
to the sequence. 

A "probe" refers to a nucleic acid or oligonucleotide that forms a hybrid 
structure with a sequence in a target nucleic acid due to complementarity of at least one 
sequence in the probe with a sequence in the target nucleic acid. Generally, a probe is 

10 labeled so it can be detected after hybridization. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate conditions of 
temperature and solution ionic strength (see Sambrook et ah, 1989, Molecular Cloning: A 

15 Laboratoiy Manual, Second Edition, Cold Spring Harbor Laboratory Press: Cold Spring 
Harbor, New York). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. For preliminary screening for homologous nucleic 
acids, low stringency hybridization conditions, corresponding to a T m of 55 °C, can be 
used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5x 

2 0 SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond to a higher 
T m , e.g., 40%o formamide, with 5x or 6x SCC. High stringency hybridization conditions 
correspond to the highest T ni , e.g., 50% formamide, 5x or 6x SCC. Hybridization requires 
that the two nucleic acids contain complementary sequences, although depending on the 
stringency of the hybridization, mismatches between bases are possible. The appropriate 

2 5 stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the 

degree of complementation, variables well known in the art. The greater the degree of 
similarity or homology between two nucleotide sequences, the greater the value of T m for 
hybrids of nucleic acids having those sequences. The relative stability (corresponding to 
higher T m ) of nucleic acid hybridizations decreases in the following order: RNA:RNA, 

3 0 DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, equations 

for calculating T m have been derived (see Sambrook et al, supra, 9.50-9.5 1). For 
hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches 
becomes more important, and the length of the oligonucleotide determines its specificity 
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(see Sambrook et al. , supra, 11.7-11 .8). A minimum length for a hybridizable nucleic acid 
is at least about 10 nucleotides: preferably at least about 15 nucleotides; and more 
preferably the length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" 
5 refers to a T n , of 55 °C, and utilizes conditions as set forth above. In a preferred 
embodiment, the T m is 60°C; in a more preferred embodiment, the T n , is 65 °C. In a 
specific embodiment, "high stringency" refers to hybridization and/or washing conditions 
at 68°C in 0.2XSSC, at 42 °C in 50% formamide, 4XSSC, or under conditions that afford 
levels of hybridization equivalent to those observed under either of these two conditions. 
10 a "gene" for a particular protein as used herein refers to a contiguous 

nucleic acid sequence corresponding to a sequence present in a genome which comprises 
(i) a "coding (or transcribed) region," which comprises exons (i.e., sequences encoding a 
polypeptide sequence, or "protein-coding" or "transcribed sequences"), introns, sequences 
at the junction between exons and introns, and 5' and 3' untranslated regions (uTRs); and 
15 (ii) regulatory sequences, which flank the coding region at both 5' and 3' termini. For 
example, the "ACE gene" as used herein encompasses the regulatory and coding regions 
of the human gene encoding angiotensin converting enzyme. Similarly, the "AGT gene" 
encompasses regulatory and coding regions of the human gene encoding angiotensinogen 
and the "ATI gene" encompasses regulatory and coding regions of the human gene 
2 0 encoding type I angiotensin II receptor. Typically, regulatory sequences according to the 
invention are located 5' (i.e., upstream) of the coding region segment. The reference 
sequences, obtained from GenBank, which were used in practicing the present invention 
are shown in Table 1 . 
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Table 1 . GenBank Accession Numbers 





Abbreviation 


Compared master 
sequence 


Numbering according to 
sequence entry in GenBank 


SEQ 

ID 

NO. 


5 


AGT Regulatory | 
Regions 


X15323 


X15323 


122 




AGT Coding 
Region 


M24686 (exon 2) 
M24687 (exon 3) 
M24688 (exon 4) j 

X62855 (intron16) 


Protein-coding sequences from 
exon 2-5 were spliced together 
as described in the GenBank 
entries 

Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon. 

X62855 


123 
124 
125 
126 

127 




ACE Regulatory 
Region 


X94359 


X94359 


128 


10 


ACE Coding 
Region 


J04144 


J04144 

Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon. 


129 




ATI Regulatory 
Region 


U07144 


U07144 


130 


15 


ATI Coding 
Region 


S80239 (exon 3) 
S77410(exon 5) 


The protein-coding sequence of 
S80239 was spliced to position 
288 of entry S77410. 
Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon in entry 
S80239. 


131 
132 




Renin 




Ml 0030, X34914, X01391, 
X01734 


133 




Aldosterone 
synthase 




D13752 


134 




AT2 




U 10273 


135 


20 


P-adrenoceptorl 
Regulatory Region 
(B1P) 




X69168/g28421 


136 


25 


P-adrenoceptorl 
Coding Region 
(B1R) 




J03019/gl7899 


137 
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p-aorenocepiorz 
Regulatory Region 
(B2P) 




M15169 

J02728 

M16106 


138 




P-adrenoceptor2 




Y00106/g29370 


139 


5 


Coding Region 










(B2R) 










EndothelinA 
(ET A ) Receptor 
Coding 




S57498 


140 



10 

The present inventors have surprisingly and unexpectedly discovered the 



existence of genetic polymorphisms within the human gene encoding ACE, ATI , AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or p- 
adrcnoceptor which, singly or in combination, can be used to assess cardiovascular status, 
1 5 depending on which component of cardiovascular status is under evaluation. In 
accordance with the invention, the polymorphic pattern of the gene, alone or in 
combination with other genes encoding renin, ACE, AGT, ATI, AT2, aldosterone 
synthase, endothelin receptor, and p-adrenergic receptors 1 and 2 in an individual can 
predict the responsivity of the individual to particular therapeutic interventions and serve 
2 0 as an indicator of predisposition to various forms of cardiovascular disease. The invention 
provides methods for assessing cardiovascular status by detecting polymorphic patterns in 
an individual. The present invention also provides isolated nucleic acids derived from the 
gene which comprise these polymorphisms, including probes which hybridize specifically 
to polymorphic positions and primers that amplify the region of the gene in which the 

2 5 polymorphism is located; isolated polypeptides and peptides comprising polymorphic 

residues; and antibodies which specifically recognize ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or p-adrenoceptor 
polypeptides containing one or more polymorphic amino acids. 

3 0 Methods for Assessing Cardiov ascular Status 

The present invention provides diagnostic methods for assessing 
cardiovascular status in a human individual. The methods are carried out by comparing a 
polymorphic position or pattern ("test polymorphic pattern") within the individual's gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
3 5 endothelin receptor, or p-adrenoceptor with the polymorphic patterns of humans 
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exhibiting a predetermined cardiovascular status ("reference polymorphic pattern"). If the 
cardiovascular status is the prediction of responsivity to a therapy, a single polymorphic 
position can provide a pattern for comparison. However, it is preferable to use more than 
one polymorphic position for the pattern to improve the accuracy of the prediction. If the 
5 cardiovascular status is predisposition to a cardiovascular syndrome, at least two, and 
preferably at least three, polymorphic positions are used to make the pattern. In addition, 
other polymorphisms in genes encoding angiotensin converting enzyme (ACE), 
angiotensinogen (AGT), type 1 angiotensin II receptor (ATI), type 2 angiotensin II 
receptor, renin, aldosterone synthase, endothelin, receptor or 0-adrenergic receptors 1 and 
10 2 can be used to establish a polymorphic pattern for the individual. 

For any meaningful prediction, the polymorphic pattern of the individual is 
identical to the polymorphic pattern of individuals who exhibit particular status markers, 
cardiovascular syndromes, and/or particular patterns of response to therapeutic 
interventions. 

15 In one embodiment, the method involves comparing an individual's test 

polymorphic pattern with reference polymorphic patterns of individuals who have been 
shown to respond positively or negatively to a particular therapeutic regimen. Therapeutic 
regimen as used herein refers to treatments aimed at the elimination or amelioration of 
symptoms and events associated cardiovascular disease. Such treatments include without 

2 0 limitation one or more of alteration in diet, lifestyle, and exercise regimen; invasive and 
noninvasive surgical techniques such as atherectomy, angioplasty, and coronary bypass 
surgery; and pharmaceutical interventions, such as administration of ACE inhibitors, 
angiotensin II receptor antagonists, diuretics, alpha-adrenoreceptor antagonists, cardiac 
glycosides, phosphodiesterase inhibitors, beta-adrenoreceptor antagonists, calcium channel 

2 5 blockers, HMG-CoA reductase inhibitors, imidazoline receptor blockers, endothelin 

receptor blockers, and organic nitrites. Interventions with pharmaceutical agents not yet 
known whose activity correlates with particular polymorphic patterns associated with 
cardiovascular disease are also encompassed. The present inventors have discovered that 
particular polymorphic patterns correlate with an individual's responsivity to ACE 

3 0 inhibitors (see, e.g., Example 3 below). It is contemplated, for example, that patients who 

are candidates for a particular therapeutic regimen will be screened for polymorphic 
patterns that correlate with responsivity to that particular regimen. 
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In another embodiment, the method involves comparing an individual's 
polymorphic pattern with polymorphic patterns of individuals who exhibit or have 
exhibited one or more markers of cardiovascular disease, such as, e.g., high blood 
pressure, abnormal electrocardiographic profile, myocardial infarction, unstable angina, 
5 stroke, or atherosclerosis (see, e.g. , Example 2 below) and drawing analogous conclusions 
as to the individual's responsivity to therapy, predisposition to developing a syndrome, 
etc., as detailed above. 

Identi fication nf Polymo rphic Patterns 
1 o in practicing the methods of the invention, an individual's polymorphic 

pattern can be established e.g. by obtaining DNA from the individual and determining the 
sequence at a predetermined polymorphic position or positions in a gene, or more than one 
gene. 

The DNA may be obtained from any cell source. Non-limiting examples of 
1 5 cell sources available in clinical practice include without limitation blood cells, buccal 
cells, cervicovaginal cells, epithelial cells from urine, fetal cells, or any cells present in 
tissue obtained by biopsy. Cells may also be obtained from body fluids, including without 
limitation blood, saliva, sweat, urine, cerebrospinal fluid, feces, and tissue exudates at the 
site of infection or inflammation. DNA is extracted from the cell source or body fluid 
2 0 using any of the numerous methods that are standard in the art. It will be understood that 
the particular method used to extract DNA will depend on the nature of the source. 

Determination of the sequence of the extracted DNA at polymorphic 
positions is achieved by any means known in the art, including but not limited to direct 
sequencing, hybridization with allele-specific oligonucleotides, allele-specific PCR, 
2 5 ligase-PCR, HOT cleavage, denaturing gradient gel electrophoresis (DGGE), and single- 
stranded conformational polymorphism (SSCP). Direct sequencing may be accomplished 
by any method, including without limitation chemical sequencing, using the Maxam- 
Gilbert method; by enzymatic sequencing, using the Sanger method; mass spectrometry 
sequencing; and sequencing using a chip-based technology. See, e.g., Little et aL, Genet. 
3 0 Anal., 1 996, 6:151. Preferably, DNA from a subject is first subjected to amplification by 
polymerase chain reaction (PCR) using specific amplification primers. 

In an alternate embodiment, biopsy tissue is obtained from a subject. 
Antibodies that are capable of distinguishing between different polymorphic forms of 
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ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 
receptor, or p-adrenoceptor are then applied to samples of the tissue to determine the 
presence or absence of a polymorphic form specified by the antibody. The antibodies may 
be polyclonal or monoclonal, preferably monoclonal. Measurement of specific antibody 
binding to cells may be accomplished by any known method, e.g., quantitative flow 
cytometry, or enzyme-linked or fluorescence-linked immunoassay. The presence or 
absence of a particular polymorphism or polymorphic pattern, and its allelic distribution 
(i.e., homozygosity vs. heterozygosity) is determined by comparing the values obtained 
from a patient with norms established from populations of patients having known 
polymorphic patterns. 

In another alternate embodiment, RNA is isolated from biopsy tissue using 
standard methods well known to those of ordinary skill in the art such as guanidium 
thiocyanate-phenol-chloroform extraction (Chomocyznski et al, Anal. Biochem., 1987, 
162:156). The isolated RNA is then subjected to coupled reverse transcription and 
amplification by polymerase chain reaction (RT-PCR), using specific oligonucleotide 
primers that are specific for a selected polymorphism. Conditions for primer annealing are 
chosen to ensure specific reverse transcription and amplification; thus, the appearance of 
an amplification product is diagnostic of the presence of a particular polymorphism. In 
another embodiment, RNA is reverse-transcribed and amplified, after which the amplified 
sequences are identified by, e.g., direct sequencing. In still another embodiment, cDNA 
obtained from the RNA can be cloned and sequenced to identify a polymorphism. 



Establishing Re ference Polymorphism Patterns 
In practicing the present invention, the distribution of polymorphic patterns 
in a large number of individuals exhibiting particular cardiovascular status is determined 
by any of the methods described above, and compared with the distribution of 
polymorphic patterns in patients that have been matched for age, ethnic origin, and/or any 
other statistically or medically relevant parameters, who exhibit quantitatively or 
qualitatively different cardiovascular status. Correlations are achieved using any method 
known in the art, including nominal logistic regression or standard least squares regression 
analysis. In this manner, it is possible to establish statistically significant correlations 
between particular polymorphic patterns and particular cardiovascular statuses. It is 
further possible to establish statistically significant correlations between particular 
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polymorphic patterns and changes in cardiovascular status such as, would result, e.g., from 
particular treatment regimens. Thus, it is possible to correlate polymorphic patterns with 
responsivity to particular treatments. 

A statistically significant correlation preferably has a "p" value of less than 
or equal to 0.05. Any standard statistical method can be used to calculate these values, 
such as the normal Student's T Test, or Fischer's Exact Test. 

The identity and number of polymorphisms to be included in a reference 
pattern depends not only on the prevalence of a polymorphism and its predictive value for 
the particular use, but also on the value of the use and its requirement for accuracy of 
prediction. The greater the predictive value of a polymorphism, the lower the need for 
inclusion of more than one polymorphism in the reference pattern. However, if a 
polymorphism is very rare, then its absence from an individual's pattern might provide no 
indication as to whether the individual has a particular status. Under these circumstances, 
it might be advisable to select instead two or more polymorphisms which are more 
prevalent. Even if none of them has a high predictive value on its own, the presence of 
both (or all three) of them might be sufficiently predictive for the particular purpose. 

If for example the use for a reference pattern is prediction of response to a 
drug, and among the afflicted population only a 30% response to the drug is observed, the 
reference pattern need only permit selection of a population that improves the response 
rate by 10% to provide a significant improvement in the state of the art. On the other 
hand, if the use for the reference pattern is selection of subjects for a particular clinical 
study, the pattern should be as selective as possible and should therefore include a 
plurality of polymorphisms that together provide a high predictive accuracy for the 
intended response. 

In establishing reference polymorphism patterns, it is desirable to use a 
defined population. For example, tissue libraries collected and maintained by state or 
national departments of health can provide a valuable resource, since genotypes 
determined from these samples can be matched with medical history, and particularly 
cardiovascular status, of the individual. Such tissue libraries are found, for example, in 
Sweden, Iceland, Norway, and Finland. As can be readily understood by one of ordinary 
skill in the art, specific polymorphisms may be associated with a closely linked 
population. However, other polymorphisms in the same gene may correlate with 
cardiovascular status of other genetically related populations. Thus, in addition to the 
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specific polymorphisms provided in the instant application, the invention identifies genes 
in which any polymorphisms can be used to establish reference and test polymorphism 
patterns for evaluating cardiovascular status of individuals in the population. 

In a specific embodiment, DNA samples can be obtained form a well 

5 defined population, such as 277 Caucasian males born in Uppsala, Sweden between 1920 
and 1924. In a specific embodiment, such individuals are selected for the test population 
based on their medical history, i.e., they were either (i) healthy, with no signs of 
cardiovascular disease (100); or (ii) had suffered one of acute myocardial infarction (68), 
silent myocardial infarction (34), stroke (18), stroke and acute myocardial infarction (19), 

L 0 or high blood pressure at age 50 (39). DNA samples are obtained from each individual. 

In a specific embodiment, DNA sequence analysis can be carried out by: 
(i) amplifying short fragments of each of the genes using polymerase chain reaction (PCR) 
and (ii) sequencing the amplified fragments. The sequences obtained from each individual 
can then be compared with the first known sequences, e.g., as set forth in Table 1, to 

15 identify polymorphic positions. 



Cnm parinp Test Patterns to Re ference Patterns 
As noted above, the test pattern from an individual can be compared to a 
reference pattern established for a predetermined cardiovascular status. Identity between 
2 0 the test pattern and the reference pattern means that the tested individual has a probability 
of having the same cardiovascular status as that represented by the reference pattern. As 
discussed above, this probability depends on the prevalence of the polymorphism and the 
statistical significance of its correlation with a cardiovascular status. 

2 5 Polymorphic Positions 

Polymorphic positions in the genes encoding ACE, ATI, AGT, renin, 
aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or P- 
adrenoceptor which are encompassed by the invention can be identified by determining the 
DNA sequence of all or part of the gene in a multiplicity of individuals in a population. 

3 0 DNA sequence determination may be achieved using any conventional method, including, 

e.g., chemical or enzymatic sequencing. 
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The polymorphic positions of genes for use in the invention include without 
limitation those listed below, whose numbering corresponds to the GenBank sequences 
listed in Table 1 . 

(i) ACE: positions in the regulatory region (designated ACR) 
5 numbered 5106, 5349, and 5496; positions in the coding region (designated ACE) 

numbered 375, 582, 731, 1060. 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; and 
position 1451 as numbered in GenBank entry X62855. 

(ii) AGT: positions in the regulatory region (designated AGR) 
numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 1218; positions in the 

10 coding region (designated AGT) numbered 273, 620, 803, 912, 997, 1116, and I174;and 

position 49 as numbered in GenBank entry M24688. 

(iii) AT 1 : positions in the regulatory region (designated ATR) numbered 
1427, 1756, 1853, 2046, 2354, 2355, and 2415; and positions in the coding region 
(designated ATI) numbered 449, 678, 1 167, and 1271. 

15 (iv) Renin: A mutant renin gene in familial elevation of prorenin, a 

point mutation in the last exon of the gene (exon 10), has been identified (Villard et al, J. 
Biol. Chem., 1994, 269:30307-12). A cytosine to thymine transition creates a premature 
stop codon at position 387 resulting in a truncated form of renin with 20 amino acids 

deleted from the carboxyl terminus. 
2 0 ( V ) Aldosterone synthase: A position in the promoter region of 

aldosterone synthase, position -344 (with the initiation codon starting at 1) has been 
reported by Cambien et al. at the International Meeting on Hypertension held in 

Amsterdam in June 1998. 

P-adrenergic receptor- 1 , positions in the regulatory region (designated BP1) 
2 5 numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 2757; and positions in the 
coding region (designated BR1) numbered 231, 758, 1037, 1251, 1403, and 1528. 

p-adrenergic receptor-2, positions in the regulatory region (designated B2P) 
numbered 932, 934, 987, 1006, 1120, 1221, 1541, and 1568; and positions in the coding 
region (designated B2R) numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, 
30 and 2070. 

Endothelin receptor type A coding region (designated ET A ) numbered 969, 
1005, 1146, and 2485. 
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In preferred embodiments, the base at each of the above polymorphic 
positions is one of: 

(i) ACE Regulatory Region: 5106C, 5106T, 5349A, 5349T, 5496T, 

and 5496C; 

5 (ii) ACE Coding Region: 375 A, 375C, 582C, 582T, 731 A, 73 1G, 

1060G, 1060A, 1215C, 1215T, 2193G,2193A, 2328 A, 2328G, 274 1G, 274 IT, 3132C, 
3132T, 3387T, 3387C, 3503G, 3503C, 3906G, and 3906A; and a deletion of nucleotides 
1451-1783 as numbered in GenBank entry X62855; 

(iii) AGT Regulatory Region: 395T, 395A, 412C, 412T, 432G, 432A, 
10 449T, 449C, 692C, 692T, 839G, 839A, 1007G, 1007 A, 1072G, 1072A, 1204C, 1204A, 

1218A, 1218G; 

(iv) AGT Coding Region: 273C, 273T, 620C, 620T, 803T, 803C, 912C, 
912T, 997G, 997C, 1 1 16G, 1 1 16A, 1 174C, and 1 174A; and A or G at position 49 in 
GenBank entry M24688; 

15 (v) ATI Regulatory Region: 1427 A, 1427T, 1756T, 1756A, 1853T, 

1853G, 2046T, 2046C, 2354A, 2354C, 2355G, 2355C, 2415A and 2415G; and 

(vi) ATI Coding Region: 449G, 449C, 678T, 678C, 1 167 A, 1 167G, 
1271A, and 1271C. 

(vii) P-adrenergic receptor- 1 regulatory region: 2238 G, 2238 A, 2577 C, 
2 0 2257 T, 2757 A, and 2757 G. 

(viii) P-adrenergic receptor- 1 coding region: 231 A, 231 G, 758 C, 758 T, 
1251 C, 1251 G, 1403 A, 1403 G, 1528 C, and 1528 A. 

(ix) P-adrenergic receptor-2 regulatory region: 934 A, 934 G, 987 C, 
987 G, 1006 A, 1006 G, 1120 C, 1120 G, 1221 C, 1221 T, 1541 C, 1541 T, 1568 C, and 

2 5 1568 T. 

(x) P-adrenergic receptor-2 coding region: 839 A, 839 G, 872 C, 872 G, 
1045A, 1045 G, 1284 C, 1284 T, 1316 A, 1316 C, 1846 C, 1846 G, 2032 A, 2032 G, 2068 
no insert, 2068 G, 2068 C, 2070 no insert, and 2070 C. 

(xi) Endothelin receptor type A: 969 C, 969 T, 1005 A, 1005 G, 1 146 

3 0 A, 1 146 G, 2485 T, and 2485 C. 

An individual may be homozygous or heterozygous for a particular 
polymorphic position (see, e.g., Table 6 below). 
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Non-limiting examples of polymorphic patterns comprising one or more 
polymorphism in ACE, AGT, and/or ATI genes according to the invention include the 
following, which were correlated with an increased incidence of clinical signs of 
cardiovascular disease: 

5 ACR 5349 A/T, AGR 1 2 1 8 A; ACR 5496 C, AGR 1 204 A/C; ACR 5496 

C/T, AGR 1218 A, AGT 620 C/T; ACE 2193 A, AGR 1204 C, ACE 2328 G; ACE 2193 
A, AGR 1204 A/C; ACE 3387 T, AGR 1218 A; ACE 3387 T, AGT 620 C/T; AGR 1204 
A/C, ATI 678 C/T; AGR 1204 A/C, ATI 1271 A/C; ACE 1215 C, AGR 1204 A/C; AGR 
1204 A/C, ATI 1 167 A, ACE 3906 A/G; AGR 1204 A, AGT 620 C, ATI 1271 A, ATI 
10 1 167 A, AGR 395 A/T; AGR 1204 A/C, AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A, 

AGR 395 T; AGR 1204 A/C, AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A/G, AGR 395 T; 
AGR 1204 A, ATI 678 C, ATI 1 167 A, AGR 395 A/T; AGR 1204 A/C, ATI 678 C/T, 
ATI 1 167 A, AGR 395 T; AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A, AGR 395 T; AGT 
620 C/T, ATI 1271 A/C, ATI 1 167 A/G, AGR 395 T; AGT 620 C, ATI 1271 A, ATI 
15 1 167 A, AGR 395 A/T; AGT 620 C, ATI 678 A, ATI 1 167 A, AGR 395 A/T; AGT 620 
C/T, ATI 678 C/T; ATI 1 167 A, AGR 395 T; ACE 2193 A, AGR 1218 A, AGT 803 A; 
ACE 2193 A, AGT 620 C/T; ACE 2328 G, AGT 620 C/T; ACE 3387 T, AGR 1204 A/C; 
ACE 2193 A, ACE 2328 G, AGR 1204 C; ACE 2193 A/G, AGR 1072 G/G, ATI 1 167 
A/A. Additional polymorphism patterns are shown in the Tables in Examples 4 and 5, 

2 0 below. 

Polymorphism Patterns Correlated With ACE Inhi bitor Responsiveness 

The following table lists a set of polymorphism patterns that have been 
found to correlate with responsiveness to ACE inhibitor treatment: 
2 5 Response to ACE-Inhibitor Treatment 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2193: 


A/G 


AGR: 1072: 


G/G 


AT1:1167: 


A/A 


ACE:2193: 


A/G 


AGR: 1072: 


G/G 


ACE: 1060: 


G/G 


ATR:2354: 


A/A 


AT1:678: 


C/T 


ATI: 1167: 


A/A 


ACR:5496: 


C/T 


AGR: 1204: 


A/A 


AGR: 839: 


G/G 


ACR:5496: 


C/T 


AGR: 1204: 


A/A 


AGT:620: 


C/C 


ACR:5496: 


C/T 


ACE: 1060: 


G/G 


AGR:449: 


C/T 
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Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACR:5496: 


C/T 


ACE:1215: 


C/T 


AGR:1204: 


A/A 


ACR5496* 


C/T 


ACE:3906: 


A/G 


AGR:1218: 


A/G 


ACR'5496* 


C/T 


AGR:449: 


C/T 


AGR:839: 


G/G 


ACR:5496: 


C/T 


AGR449: 


C/T 


AT1:1271: 


A/A 


ACR:5496: 


C/T 


AGR-.449: 


C/T 


AGR:1072: 


G/G 


ACR:5496: 


C/T 


AGR:1072: 


G/G 


AGR:1204: 


A/A 


ACE- 1060- 


G/G 


AGR:449: 


C/T 


AGT:620: 


C/C 


ACE- 1060* 


G/G 


AGR:1007: 


G/G 


AT1:678: 


C/T 


ACE- 1215- 


C/T 


ACE:3906: 


A/G 


AGR1218: 


A/G 


AGR-.449: 


C/T 


AGR:1204: 


A/A 


AGT:620: 


C/C 


AGR:449: 


C/T 


AGR:1204: 


A/A 


AGT:1116: 


G/G 


AGR:449: 


C/T 


AGR839: 


G/G 


AGT:620: 


C/C 


AGR:449: 


C/T 


AGT:620: 


C/C 


AGT:1116: 


G/G 


AGR:449: 


C/T 


AGT:1116: 


G/G 


AT1:1271: 


A/A 


AGR:449: 


C/T 


AGR:1072: 


G/G 


AGT:620: 


C/C 


AGR:1007: 


G/C 


AGR1072: 


G/G 


AT1:678: 


C/T 


AGR:1072: 


G/G 


AGR1204: 


A/A 


AGT:1116: 


G/G 


ACR:5496: 


C/T 


AGR:1204: 


A/A 


ATI: 1167: 


A/A 


AGR:1007: 


G/G 


AGR:1204: 


A/A 


AGT:1116: 


G/G 



Polymorphism Patterns of Correlated With ACE Inhibitor Non-responsiveness 
The following table lists a set of polymorphism patterns that have been 
found to correlate with non-responsiveness to ACE inhibitor treatment: 

Non-Response to ACE-Inhibitor Treatment 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE: 1060: 


G/G 


AGR:1204: 


A/C 


AGT:620: 1 


C/C 


ACE:1215: 


C/T 


ACE-.2193: 


A/G 


AGR.1204 


A/C 


ACE.2193: 


A/G 


ATR:2046: 


C/T 


AT1:678: 


C/C 
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Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotvpe 3 


ACE:2193: 


A/G 


AT1:678: 


C/C 


AT1:1271: 


A/C 



Polymorphism Patterns Correlated With Predisposition to MI 
The following table lists a set of polymorphism patterns that have been 
found to correlate with predisposition to myocardial infarction: 

Predisposition to MI 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2193: 


A/A 


AGR:1204: 


A/C 






AGR:449: 


T/T 


AGR:1204: 


A/C 


AT1T271: 


A/C 


ACE:2193: 


A/G 


AGR:620: 


C/T 


AGR:1116 


G/G 


ACE:2193: 


A/G 


AGR:449: 


T/T 


AT1:1271: 


A/C 


ACR:5349: 


A/A . 


AGR:449: 


T/T 


AT1:1271: 


A/C 


ACE:2193: 


A/A 


AGR:620: 


C/T 


AT1:1271: 


A/A 



10 



15 



Polymorphism Patterns Correlated With Predisposition to Stroke 

The following table lists a set of polymorphism patterns that have been 
found to correlate with predisposition to stroke: 

Predisposition to Stroke 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2193: 


A/A 


AGR:395: 


A/T 






AGR:1007: 


A/G 


AGR:1072: 


G/G 


AT1:1167: 


A/A 


AGR:395: 


A/T 


AGR:1072: 


G/G 


AGRT218: 


A/G 



20 



Polymorphism Patterns in B-Adrenergic Receptor Genes 
2 5 The following table lists a set of polymorphism patterns that have been 

found in fi-adrenergic receptor genes: 

Positions carrying genetic variation in the Beta adrenergic receptors 1 and 

2. 
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B1P 


B1R 


B2R 




2238 C/A 


231 A/G 


839 A/G 


932 


2440 


758 C/T 
1037 


872 C/G 


934 A/G 


2493 


1251 C/G 


1045 A/G 


70 / V/U 

1006 A/G 


2502 


1403 A/G 


1284 C/T 


1120 C/G 


2577 C/T 


1528 C/A 


1316 A/C 


1221 C/T 


2585 




1846 C/G 


1541 C/T 


2693 




1891 


1568 C/T 


2724 




2032 A/G 




2757 A/G 




2068 no insert, G/G, 
C/G 

2070 no insert /C 





10 



15 



20 



25 



B1P: Beta adrenergic receptor 1, regulatory promoter region. 

B1R: Beta adrenergic receptor 1, coding region. 

B2P: Beta adrenergic receptor 2, regulatory promoter region. 

B2R: Beta adrenergic receptor 2, coding region. 

A number of different polymorphisms have been identified in the type 2 P- 
adrenoceptor. All of these differed from the wild type sequence by a single base change. 
Four of the polymorphisms alter the amino acid sequence of the receptor protein (Hall, 
Thorax, 1996, 51:351-353). The amino acid sequence modifications are described in 

greater detail below: 

Argl6-Gly: The Glyl6 variant undergoes an enhanced agonist-promoted 
down regulation as compared to wild type but the coupling to adenylyl cyclase and agonist 
binding are maintained (Liggett, Am. J. Respir. Crit. Care Med., 1997, 156:S 156-S162). 

Gln27-Glu: The Glu27 variant displays very little agonist-promoted 
downregulation and the coupling to adenylyl cyclase and agonist binding are maintained 
(id.). 

Val34-Met: Met34 is very rare. No altering of receptor function has been 

found (id.). 
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Thrl64-Ile: Uncommon (about 5%). The He 164 variant shows depressed 
coupling to adenylyl cyclase and decreased affinities for agonists with hydroxy! groups on 
their P-carbons, such as epinephrine, norepinephrine, and isoproterenol compared to wild 
type (id.). 

5 The polymorphism at nucleic acid 523 (CGG-AGG) might be linked with 

one of the other functional polymorphisms (id.). 

There are no differences in frequency of these polymorphisms between the 
normal group and those with asthma but they have been correlated to differences in 
response to treatment with agonists in asthma, e.g., the Glyl6 variant undergoes an 
1 0 enhanced agonist-promoted downregulation compared to wild type (id.). 



Polymorphism Patterns in Endothelin Receptor Type A Gene 
The following table lists a set of polymorphism patterns that have been 
found in the coding region of the endothelin receptor type A gene: 



Position 969 


Position 1005 


Position 1 146 


Position 2485 


C/C 


A/A 


A/A 


T/T 


C/T 


A/G 


A/G 


T/C ! 


T/T 


G/G 







20 Isolated Polymorphic Nucleic Acids, Vectors. Probes & Primers and Arrays 

Vectors for Expression of Polymorphic Variants 
The present invention provides isolated nucleic acids comprising the 
polymorphic positions described above for the human genes encoding ACE, ATI, AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, and p- 

2 5 adrenoceptor; vectors comprising the nucleic acids; and transformed host cells comprising 

the vectors. The invention also provides probes which are useful for detecting these 
polymorphisms. 

The nucleic acids encoding a gene comprising a polymorphism that is 
useful for determining cardiovascular status of an individual is particularly valuable for 

3 0 screening, whether by direct screening of the nucleic acid with the polymorphism, or by 

screening the polypeptide expressed by that nucleic acid. 
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In practicing the present invention, many conventional techniques in 
molecular biology, microbiology, and recombinant DNA, are used. Such techniques are 
well known and are explained fully in, for example, Sambrook et ai, Molecular Cloning: 
A Laboratory Manual, Second Edition, 1989 (Cold Spring Harbor Laboratory Press, Cold 
5 Spring Harbor, New York); DNA Cloning: A Practical Approach, Volumes I and II, 1985 
(D.N. Glover ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); Nucleic Acid 
Hybridization, 1985, (Hames and Higgins); Ausubel et ai, Current Protocols in 
Molecular Biology, 1997 (John Wiley and Sons); and Methods in Enzymology Vol. 154 
and Vol. 155 (Wu and Grossman, and Wu, eds., respectively). 
! o Insertion of nucleic acids (typically DNAs) comprising the sequences of the 

present invention into a vector is easily accomplished when the termini of both the DNAs 
and the vector comprise compatible restriction sites. If this cannot be done, it may be 
necessary to modify the termini of the DNAs and/or vector by digesting back single- 
stranded DNA overhangs generated by restriction endonuclease cleavage to produce blunt 
1 5 ends, or to achieve the same result by filling in the single-stranded termini with an 
appropriate DNA polymerase. 

Alternatively, any site desired may be produced, e.g., by ligating nucleotide 
sequences (linkers) onto the termini. Such linkers may comprise specific oligonucleotide 
sequences that define desired restriction sites. Restriction sites can also be generated by 
2 0 the use of the polymerase chain reaction (PCR). See, e.g. , Saiki et al, Science, 1 988, 

239:48. The cleaved vector and the DNA fragments may also be modified if required by 

homopolymeric tailing. 

The nucleic acids may be isolated directly from cells or may be chemically 
synthesized using known methods. Alternatively, the polymerase chain reaction (PCR) 

2 5 method can be used to produce the nucleic acids of the invention, using either chemically 

synthesized strands or genomic material as templates. Primers used for PCR can be 
synthesized using the sequence information provided herein and can further be designed to 
introduce appropriate new restriction sites, if desirable, to facilitate incorporation into a 
given vector for recombinant expression. 

3 o The nucleic acids of the present invention may be flanked by native gene 

sequences, or may be associated with heterologous sequences, including promoters, 
enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5'- 
and 3'- noncoding regions, and the like. 
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The invention also provides nucleic acid vectors comprising the disclosed 
genes or derivatives or fragments thereof. A large number of vectors, including plasmid 
and fungal vectors, have been described for replication and/or expression in a variety of 
eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple 
5 cloning or protein expression. Non-limiting examples of suitable vectors include without 
limitation pUC plasmids, pET plasmids (Novagen, Inc., Madison, WI), or pRSET or pREP 
(Invitrogen, San Diego, CA), and many appropriate host cells, using methods disclosed or 
cited herein or otherwise known to those skilled in the relevant art. The particular choice 
of vector/host is not critical to the practice of the invention. 
1 o Suitable host cells may be transformed/transfected/infected as appropriate 

by any suitable method including electroporation, CaCl 2 mediated DNA uptake, calcium 
phosphate precipitation, fungal or viral infection, lipofection, microinjection, 
microprojectile, or other established methods. Appropriate host cells included bacteria, 
archebacteria, fungi, especially yeast, and plant and animal cells, especially mammalian 
1 5 cells. A large number of transcription initiation and termination regulatory regions have 
been isolated and shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Examples of these regions, methods of isolation, manner of 
manipulation, etc. are known in the art. Under appropriate expression conditions, host 
cells can be used as a source of recombinantly produced ACE-, AGT-, or ATI -derived 
20 peptides and polypeptides. 

Nucleic acids encoding ACE-, AGT-, or ATI-derived gene sequences may 
also be introduced into cells by recombination events. For example, such a sequence can 
be introduced into a cell and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other 

2 5 recombination-based methods such as nonhomologous recombinations or deletion of 

endogenous genes by homologous recombination may also be used. 

Oli gonucleotides 
The nucleic acids of the present invention find use as probes for the 

3 0 detection of genetic polymorphisms, as primers for the expression of polymorphisms, or in 

molecular library arrays for high throughput screening. 

Probes in accordance with the present invention comprise without 
limitation isolated nucleic acids of about 10 - 100 bp, preferably 15-75 bp and most 
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preferably 17-25 bp in length, which hybridize at high stringency to one or more of the 
gene-derived polymorphic sequences disclosed herein or to a sequence immediately 
adjacent to a polymorphic position. Furthermore, in some embodiments a full-length gene 
sequence may be used as a probe. In one series of embodiments, the probes span the 
polymorphic positions in the genes disclosed above. In another series of embodiments, the 
probes correspond to sequences immediately adjacent to the polymorphic positions. 

The oligonucleotide nucleic acids may also be modified by many means 
known in the art. Non-limiting examples of such modifications include methylation, 
"caps", substitution of one or more of the naturally occurring nucleotides with an analog, 
intcrnucleotide modifications such as, for example, those with uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Nucleic acids may 
contain one or more additional covalently linked moieties, such as, for example, proteins 
(e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., 
acridinc, psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, 
etc.), and alkylators. PNAs are also included. The nucleic acid may be derivatized by 
formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. 
Furthermore, the nucleic acid sequences of the present invention may also be modified 
with a label capable of providing a detectable signal, either directly or indirectly. 
Exemplary labels include radioisotopes, fluorescent molecules, biotin, and the like. 

PCR amplification of gene segments that contain a polymorphism provides 
a powerful tool for detecting the polymorphism. The oligonucleotides of the invention can 
also be used as PCR primers to amplify gene segments containing a polymorphism of 
interest. The amplified segment can be evaluated for the presence or absence of a 
polymorphism by restriction endonuclease activity, SSCP, or by direct sequencing. In 
another embodiment, the primer is specific for a polymorphic sequence on the gene. If the 
polymorphism is present, the primer can hybridize and DN A will be produced by PCR. 
However, if the polymorphism is absent, the primer will not hybridize, and no DNA will 
be produced. Thus, PCR can be used to directly evaluate whether a polymorphism is 
present or absent. 

Molecular library arrays of oligonucleotides (including oligonucleotides 
with modifications as described above) are another powerful tool for rapidly assessing 
whether one or more polymorphisms are present in a ACE, ATI, AGT, renin, aldosterone 
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synthase, type-2 angiotensin II receptor, endothelin receptor, or ^-adrenoceptor gene, 
preferably in combination with other genes. Molecular library arrays are disclosed in US 
Patents No. 5,677,195, No. 5,599,695, No. 5,545,531, and No. 5,510,270. 

5 Polymorphic Polypeptides and Polymorphism-Specific Antibodies 

The present invention encompasses isolated peptides and polypeptides 
encoded by all or a portion of a gene encoding a polypeptide selected from the group 
consisting of ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, and p-adrenoceptor, comprising polymorphic positions disclosed 

10 above. In one preferred embodiment, the peptides and polypeptides are useful screening 
targets to identify cardiovascular drugs. In another preferred embodiment, the peptides 
and polypeptides are capable of eliciting antibodies in a suitable host animal that react 
specifically with a polypeptide comprising the polymorphic position and distinguish it 
from other polypeptides having a different amino acid sequence at that position. 

15 Polypeptides according to the invention are preferably at least five or more 

residues in length, preferably at least fifteen residues. Methods for obtaining these 
polypeptides are described below. Many conventional techniques in protein biochemistry 
and immunology are used. Such techniques are well known and are explained in 
Immunochemical Methods in Cell and Molecular Biology, 1987 (Mayer and Waler, eds; 

2 0 Academic Press, London); Scopes, Protein Purification: Principles and Practice, Second 
Edition 1987 (Springer- Verlag, N.Y.) and Handbook of Experimental Immunology, 
Volumes I-IV 1986 (Weir and Blackwell eds.). 

Nucleic acids comprising protein-coding sequences can be used to direct 
the recombinant expression of ACE, ATI, AGT, renin, aldosterone synthase, type-2 

2 5 angiotensin II receptor, endothelin receptor, or P-adrenoceptor-derived polypeptides in 

intact cells or in cell-free translation systems. The known genetic code, tailored if desired 
for more efficient expression in a given host organism, can be used to synthesize 
oligonucleotides encoding the desired amino acid sequences. The polypeptides may be 
isolated from human cells, or from heterologous organisms or cells (including, but not 

3 0 limited to, bacteria, fungi, insect, plant, and mammalian cells) into which an appropriate 

protein-coding sequence has been introduced and expressed. Furthermore, the 
polypeptides may be part of recombinant fusion proteins. 
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Peptides and polypeptides may be chemically synthesized by commercially 
available automated procedures, including, without limitation, exclusive solid phase 
synthesis, partial solid phase methods, fragment condensation or classical solution 
synthesis. The polypeptides are preferably prepared by solid phase peptide synthesis as 
5 described by Merrifield, J. Am. Chem. Soc, 1963, 85:2149. 

Methods for polypeptide purification are well-known in the art, including, 
without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, 
reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and 
countercurrent distribution. For some purposes, it is preferable to produce the polypeptide 
10 in a recombinant system in which the protein contains an additional sequence tag that 
facilitates purification, such as. but not limited to, a polyhistidine sequence. The 
polypeptide can then be purified from a crude lysate of the host cell by chromatography on 
an appropriate solid-phase matrix. Alternatively, antibodies produced against ACE, ATI, 
AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or 
15 p-adrenoceptor or against peptides derived therefrom, can be used as purification reagents. 
Other purification methods are possible. 

The present invention also encompasses derivatives and homologues of the 
polypeptides. For some purposes, nucleic acid sequences encoding the peptides may be 
altered by substitutions, additions, or deletions that provide for functionally equivalent 
2 0 molecules, i.e., function-conservative variants. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of similar 
properties, such as, for example, positively charged amino acids (arginine, lysine, and 
histidine); negatively charged amino acids (aspartate and glutamate); polar neutral amino 
acids; and non-polar amino acids. 

2 5 The isolated polypeptides may be modified by, for example, 

phosphorylation, sulfation, acylation, or other protein modifications. They may also be 
modified with a label capable of providing a detectable signal, either directly or indirectly, 
including, but not limited to, radioisotopes and fluorescent compounds. 

The present invention also encompasses antibodies that specifically 

3 0 recognize the polymorphic positions of the invention and distinguish a peptide or 

polypeptide containing a particular polymorphism from one that contains a different 
sequence at that position. Such polymorphic position-specific antibodies according to the 
present invention include polyclonal and monoclonal antibodies. The antibodies may be 
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elicited in an animal host by immunization with ACE, ATI, ACT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or P-adrenoceptor-derived 
immunogenic components or may be formed by in vitro immunization of immune cells. 
The immunogenic components used to elicit the antibodies may be isolated from human 
5 cells or produced in recombinant systems. The antibodies may also be produced in 
recombinant systems programmed with appropriate antibody-encoding DNA. 
Alternatively, the antibodies may be constructed by biochemical reconstitution of purified 
heavy and light chains. The antibodies include hybrid antibodies (i.e., containing two sets 
of heavy chain/light chain combinations, each of which recognizes a different antigen), 
10 chimeric antibodies (i.e., in which either the heavy chains, light chains, or both, are fusion 
proteins), and univalent antibodies (i.e., comprised of a heavy chain/light chain complex 
bound to the constant region of a second heavy chain). Also included are Fab fragments, 
including Fab' and F(ab) 2 fragments of antibodies. Methods for the production of all of the 
above types of antibodies and derivatives are well-known in the art and are discussed in 
1 5 more detail below. For example, techniques for producing and processing polyclonal 
antisera are disclosed in Mayer and Walker, Immunochemical Methods in Cell and 
Molecular Biology, 1987 (Academic Press, London). The general methodology for 
making monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques such as 
2 0 direct transformation of B lymphocytes with oncogenic DNA, or transfection with 

Epstein-Barr virus. See, e.g., Schreier et al., Hybridoma Techniques, 1980; U.S. Patent 
Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; and 
4,493,890. Panels of monoclonal antibodies produced against ACE, AGT, or ATI-derived 
epitopes can be screened for various properties; i.e. for isotype, epitope affinity, etc. 
25 The antibodies of this invention can be purified by standard methods, 

including but not limited to preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, 
and countercurrent distribution. Purification methods for antibodies are disclosed, e.g., in 
The Art of Antibody Purification, 1989 (Amicon Division, W.R. Grace & Co.) General 
3 0 protein purification methods are described in Protein Purification: Principles and 
Practice, R.K. Scopes, Ed., 1987 (Springer- Verlag, New York, NY). 

Methods for determining the immunogenic capability of the disclosed 
sequences and the characteristics of the resulting sequence-specific antibodies and immune 
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cells are well-known in the art. For example, antibodies elicited in response to a peptide 
comprising a particular polymorphic sequence can be tested for their ability to specifically 
recognize that polymorphic sequence, i.e., to bind differentially to a peptide or polypeptide 
comprising the polymorphic sequence and thus distinguish it from a similar peptide or 
polypeptide containing a different sequence at the same position. 

Diagnostic Methods and Kits 

The present invention provides kits for the determination of the sequence at 
a polymorphic position or positions within the ACE, ATI, ACT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or ^-adrenoceptor gene in 
an individual, in combination with determination of the sequence at polymorphism 
positions of other genes. The kits comprise a means for determining the sequence at the 
polymorphic positions, and may optionally include data for analysis of polymorphic 
patterns. The means for sequence determination may comprise suitable nucleic acid-based 
j and immunological reagents (see below). Preferably, the kits also comprise suitable 

buffers, control reagents where appropriate, and directions for determining the sequence at 
a polymorphic position. The kits may also comprise data for correlation of particular 
polymorphic patterns with desirable treatment regimens or other indicators. 

3 Nucleic Acid-Based Diagnostic Methods and Kits 

The invention provides nucleic acid-based methods for detecting 
polymorphic patterns in a biological sample. The sequence at particular polymorphic 
positions in the genes is determined using any suitable means known in the art, including 
without limitation hybridization with polymorphism-specific probes and direct 

5 sequencing. 

The present invention also provides kits suitable for nucleic acid-based 
diagnostic applications. In one embodiment, diagnostic kits include the following 
components: 

(i) Probe DNA: The probe DNA may be pre-labelled; alternatively, 
0 the probe DNA may be unlabelled and the ingredients for labelling may be included in the 
kit in separate containers; and 
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(ii) Hybridization reagents: The kit may also contain other suitably 
packaged reagents and materials needed for the particular hybridization protocol, including 
solid-phase matrices, if applicable, and standards. 

In another embodiment, diagnostic kits include: 

(i) Sequence determination primers: Sequencing primers may be 
pre-labelled or may contain an affinity purification or attachment moiety; and 

(ii) Sequence determination reagents: The kit may also contain 
other suitably packaged reagents and materials needed for the particular sequencing 
protocol. In one preferred embodiment, the kit comprises a panel of sequencing primers, 
whose sequences correspond to sequences adjacent to the polymorphic positions. 

Antibody-Based Diagnostic Methods and Kits 
The invention also provides antibody-based methods for detecting 
polymorphic patterns in a biological sample. The methods comprise the steps of: (i) 
contacting a sample with one or more antibody preparations, wherein each of the antibody 
preparations is specific for a particular polymorphic form of the gene under conditions in 
which a stable antigen-antibody complex can form between the antibody and antigenic 
components in the sample; and (ii) detecting any antigen-antibody complex formed in step 
(i) using any suitable means known in the art, wherein the detection of a complex indicates 
the presence of the particular polymorphic form in the sample. 

Typically, immunoassays use either a labelled antibody or a labelled 
antigenic component (e.g., that competes with the antigen in the sample for binding to the 
antibody). Suitable labels include without limitation enzyme-based, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays that amplify the signals from the 
probe are also known, such as, for example, those that utilize biotin and avidin, and 
enzyme-labelled immunoassays, such as ELISA assays. 

The present invention also provides kits suitable for antibody-based 
diagnostic applications. Diagnostic kits typically include one or more of the following 
components: 

(i) Polymorphism-specific antibodies: The antibodies may be pre-labelled; 
alternatively, the antibody may be unlabelled and the ingredients for labelling may be 
included in the kit in separate containers, or a secondary, labelled antibody is provided; 
and 
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(ii) Reaction components: The kit may also contain other suitably 
packaged reagents and materials needed for the particular immunoassay protocol, 
including solid-phase matrices, if applicable, and standards. 

The kits referred to above may include instructions for conducting the test. 
5 Furthermore, in preferred embodiments, the diagnostic kits are adaptable to high- 
throughput and/or automated operation. 

Drug Targets and Scr eening Methods 

According to the present invention, nucleotide sequences derived from the 
10 gene encoding a polymorphic form of ACE, ATI, ACT, renin, aldosterone synthase, type- 
2 angiotensin II receptor, endothelin receptor, or P-adrenoceptor, and peptide sequences 
derived from that polymorphic for, are useful targets to identify cardiovascular drugs, i.e., 
compounds that are effective in treating one or more clinical symptoms of cardiovascular 
disease. Drug targets include without limitation (i) isolated nucleic acids derived from the 
1 5 gene encoding ACE, AT 1 , AGT, renin, aldosterone synthase, type-2 angiotensin II 

receptor, endothelin receptor, or P-adrenoceptor and (ii) isolated peptides and polypeptides 
derived from ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or p-adrenoceptor polypeptides, each of which comprises one or more 
polymorphic positions. 

20 

In vitro screening methods 
In one series of embodiments, an isolated nucleic acid comprising one or 
more polymorphic positions is tested in vitro for its ability to bind test compounds in a 
sequence-specific manner. The methods comprise: 

2 5 (i) providing a first nucleic acid containing a particular sequence at a 

polymorphic position and a second nucleic acid whose sequence is identical to that of the 
first nucleic acid except for a different sequence at the same polymorphic position; 

(ii) contacting the nucleic acids with a multiplicity of test compounds under 

conditions appropriate for binding; and 

3 o (iii) identifying those compounds that bind selectively to either the first or 

second nucleic acid sequence. 

Selective binding as used herein refers to any measurable difference in any 
parameter of binding, such as, e.g., binding affinity, binding capacity, etc. 
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In another series of embodiments, an isolated peptide or polypeptide 
comprising one or more polymorphic positions is tested in vitro for its ability to bind test 
compounds in a sequence-specific manner. The screening methods involve: 

(i) providing a first peptide or polypeptide containing a particular sequence 
at a polymorphic position and a second peptide or polypeptide whose sequence is identical 
to the first peptide or polypeptide except for a different sequence at the same polymorphic 
position; 

(ii) contacting the polypeptides with a multiplicity of test compounds under 
conditions appropriate for binding; and 

(iii) identifying those compounds that bind selectively to one of the nucleic 
acid sequences. 

In preferred embodiments, high-throughput screening protocols are used to 
survey a large number of test compounds for their ability to bind the genes or peptides 
disclosed above in a sequence-specific manner. 

Test compounds are screened from large libraries of synthetic or natural 
compounds. Numerous means are currently used for random and directed synthesis of 
saccharide, peptide, and nucleic acid based compounds. Synthetic compound libraries are 
commercially available from Maybridge Chemical Co. (Trevillet, Cornwall, UK), 
Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), and Microsource (New 
Milford, CT). A rare chemical library is available from Aldrich (Milwaukee, WI). 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and 
animal extracts are available from e.g. Pan Laboratories (Bothell, WA) or MycoSearch 
(NC), or are readily producible. Additionally, natural and synthetically produced libraries 
and compounds are readily modified through conventional chemical, physical, and 
biochemical means. 

In vivo screening methods 
Intact cells or whole animals expressing polymorphic variants of a gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or P-adrenoceptor can be used in screening methods to identify 
candidate cardiovascular drugs. 
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In one series of embodiments, a permanent cell line is established from an 
individual exhibiting a particular polymorphic pattern. Alternatively, cells (including 
without limitation mammalian, insect, yeast, or bacterial cells) are programmed to express 
a gene comprising one or more polymorphic sequences by introduction of appropriate 
DNA. Identification of candidate compounds can be achieved using any suitable assay, 
including without limitation (i) assays that measure selective binding of test compounds to 
particular polymorphic variants of the gene; (ii) assays that measure the ability of a test 
compound to modify (i.e., inhibit or enhance) a measurable activity or function of the 
gene; and (iii) assays that measure the ability of a compound to modify (i.e., inhibit or 
enhance) the transcriptional activity of sequences derived from the promoter (i.e., 
regulatory) regions the gene. 

In another series of embodiments, transgenic animals are created in which 

(i) a human ACE, ATI, AGT t renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or ^-adrenoceptor gene having different sequences at particular 
polymorphic positions are stably inserted into the genome of the transgenic animal; and/or 

(ii) the endogenous genes are inactivated and replaced with human genes having different 
sequences at particular polymorphic positions. See, e.g., Coffman, Semin. Nephrol., 1997, 
17:404; Esther et al, Lab. Invest., 1996, 74:953; Murakami et al, Blood Press. Suppl., 
1996, 2:36. Such animals can be treated with candidate compounds and monitored for one 
or more clinical markers of cardiovascular status. 

Furthermore, populations that are not amenable to an established treatment 
for a cardiovascular disease or disorder can be selected for testing of alternative 
treatments. Moreover, treatments that are not as effective in the general population, but 
that are highly effective in the selected population, may be identified that otherwise would 
be overlooked. This is an especially powerful advantage of the present invention, since it 
eliminates some of the randomness associated with clinical trials. 

The following are intended as non-limiting examples of the invention. 

Example 1 : Methods for Identification of Polymorphic Positions in Human Genes 
Encoding ACE, AGT. and ATI 

The following studies were performed to identify polymorphic residues 
within the genes encoding human ACE, AGT, and ATI. 
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DNA samples were obtained from 277 individuals. The individuals were 
Caucasian males born in Uppsala, Sweden between 1920 and 1924. Individuals were 
selected for the test population based on their medical history, i.e., they were either (i) 
healthy, with no signs of cardiovascular disease (100); or (ii) had suffered one of acute 
5 myocardial infarction (68), silent myocardial infarction (34), stroke (18), stroke and acute 
myocardial infarction (19), or high blood pressure at age 50 (39). DNA samples were 
obtained from each individual. 

DNA sequence analysis was carried out by: (i) amplifying short fragments 
of each of the ACE, AGT, and ATI genes using polymerase chain reaction (PCR) and (ii) 
1 0 sequencing the amplified fragments. The sequences obtained from each individual were 
then compared with known ACE, AGT, and ATI genomic sequences (see Table 1). 

(i) Amplification: PCR reactions employed the primers shown in Table 2 

below. 
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Where indicated, the primers were modified in one of the following ways: 
(i) a biotin molecule was conjugated to the 5' terminus of the indicated sequence (B); (ii) a 
sequence of nucleotides derived from Ml 3, 5'-CAGGAAACAGCTATGACT-3' (SEQ ID 
NO: 120), was added at the 5 1 terminus of the indicated sequence (MT); or (iii) the 
sequence S'-AGTCACGACGTTGTAAAACGACGGCCAGT-S' (SEQ ID NO: 121) was 
added at the 5' terminus of the indicated sequence (T = Tail). Nucleotides were numbered 
according to the GenBank sequences listed in Table 1 where indicated. When the 
sequences involved were not publicly available, the numbering was as in the following 
examples: The designation "i-4: 1-200" indicates that the primer sequence is located 
within the sequence extending 200 bp upstream of, and including, the nucleotide 
immediately upstream of the first coding nucleotide of exon 4. Similarly, the designation 
"i+4: 1-200" indicates that the primer sequence is located within the sequence extending 
from the nucleotide that is located immediately downstream of the last coding nucleotide 
of exon 4 downstream for 200 bp. In each case, the specific location of the primer 
sequence is indicated in Table 2 in the column marked "Nucleotides". 

The reaction components used for PCR are described in Table 3, and the 
reaction conditions for PCR are described Table 4, below. 
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All of the PCR products (except fragments ACEDI, ATI-spec. 1 and ATI- 
spec. 2) were subjected to solid phase sequencing according to the protocol commercially 
available from Pharmacia Biotech. The sequencing reactions are performed with a 
sequencing primer having a complementary sequence to the "Tail" sequence previously 
described in Table 2. The nucleotide sequence of the sequencing primer was 5'- 
CGACGTTGTAA AACG ACGGCC AGT-3 ' (SEQ ID NO: 122), and the primer was 
fluorescently labeled with a Cy-5-molecule on the 5' -nucleotide. The positions carrying a 
genetic variation were identified by determination of the nucleotide sequence by the use of 
the ALFexpress™ system commercially available from Pharmacia Biotech. 

The detection of the fragment ACEDI was performed by analyzing the sizes 
of the amplified fragments by gel electrophoresis, where the presence of a shorter PCR 
product (192 base pairs) indicated the D-allele and a longer PCR product (479 base pairs) 
indicated the I-allele. The presence of both bands indicated a heterozygote for the two 
alleles. The detection of the allele-specific reaction of position ATI- 1271 was performed 
by separately running two parallel PCR reactions on the same sample and comparing the 
sizes of the amplified fragments. A PCR product of 501 base pairs should always be 
present as a control in both parallel runs, whereas the presence of a PCR product of 378 
base pairs in the reaction designated ATI -spec. 1 indicated the presence of an A in this 
position. The presence of a PCR product of 378 base pairs in the reaction designated ATI- 
spec. 2 indicated a C in this position. If the shorter PCR product was present in both 
reactions, the individual is a heterozygote for A and C. 

Results 

The analysis described above resulted in the identification of polymorphic 
positions within the regulatory and coding/intron segments of the human genes encoding 
ACE, AGT, and ATI . The polymorphic positions, the variant nucleotides found at each of 
the positions, and the PCR fragment in which the polymorphism was identified are shown 
in Table 6 below. Also shown are the frequencies of each genotype in a population of 90 
individuals, expressed as the percent of the study population having that genotype. 
Polymorphisms that resulted in alternate amino acids in ACE, AGT, or ATI are also 
indicated. As used herein below, the designations "AGR" , "ACR", and " ATR" refer to 
the regulatory regions of the human AGT, ACE, and ATI genes, respectively; and the 
designations "AGT", "ACE", and "ATI", refer to the coding regions of the AGT, ACE, 
and ATI genes. 
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Ser - Thr 


None 


Frequency 
(per cent) 
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AA-AG-GG 


TT 


CC-CT 


TT-TC-CC 


GG 


GG-GA 


AA-AT-TT 


TT-TA-AA 
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A subset of these polymorphic positions were further analyzed in an 
additional 187 individuals. Table 7 shows the polymorphic positions, the sequence at 
these positions, and the genotype frequencies for each position in a population of 277 as 
described in Example 1 above. 



Table 7 - Polymorphic Positions and Frequencies 



Gene 


Position 


Genetic variation 


Frequency (per cent) 


AGR 


395 


TT-TA-AA 


87-12-7 ! 


AGR 


432 


GG-GA-AA 


78-21-1 


AGR 


449 


TT-TC-CC 


94-5-1 


AGR 


692 


CC-CT-TT 


78-21-1 


AGR 


839 


GG-GA 


96-4 


AGR 


1007 


GG-GA-AA 


78-21-1 


AGR 


1072 


GG-GA 


76-24 


AGR 


1204 


CC-CA-AA 


3-27-70 


AGR 


1218 


AA-AG-GG 


16-50-34 


AGT 


620 


CC-CT-TT 


75-23-2 


AGT 


803 


TT-TC-CC 


34-50-16 


AGT 


1116 


GG-GA-AA 


83-15-2 


ACR 


5349 


AA-AT-TT 


37-44-19 


ACR 


5496 


TT-TC-CC 


38-43-19 


ACE 


1060 


GG-GA 


96-4 


ACE 


1215 


CC-CT-TT 


34-46-20 


ACE 


2193 


GG-GA-AA 


22-53-25 


ACE 


2328 


AA-AG-GG 


23-52-25 


ACE 


3387 


TT-TC-CC 


24-53-23 


ACE 


! 3906 


GG-GA-AA 


86-13-1 


ATR 


1427 


AA-AT-TT 


72-26-2 


ATR 


1756 


TT-TA-AA 


72-25-3 


ATR 


1853 


TT-TG-GG 


73-25-2 


ATR 


2046 


CC-CT-TT 


47-41-12 


ATR 


2354 


AA-AC-CC 


72-26-2 


ATR 


2355 


GG-GC-CC 


71-27-2 
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Gene 


Position 


Genetic variation 


Frequency (per cent) 


ATR 


2415 


AA-AG-GG 


73-25-2 


ATI 


678 


CC-CT-TT 


26-51-23 


ATI 


1167 


AA-AG 


88-12 


ATI 


1271 


AA-AC-CC 


55-36-9 



Example 2: Correlation of Polymorphism Patterns with Cardiovas cular Disease 

The polymorphic positions identified as in Example 1 were correlated with 
the following markers of cardiovascular status present in the study population: healthy 
(100 individuals evaluated); myocardial infarction (MI) (120 individuals); stroke (37 
individuals); and high blood pressure (BP) (39 individuals). Polymorphic patterns, i.e., 
combinations of sequences at particular polymorphic positions, that show a statistically 
significant correlation with one or more of these markers are shown below. 



Table 8 - Polymorphism Patterns 



ACR 5349 A/T, AGR 1218 A 



Healthy 



MI 



Stroke 



High BP 



Total (n) 



# of events 



17 



% within group | 3 



5.8 



8.1 



12.8 



ACR 5496 C, AGR 1204 A/C 



# of events 



Healthy 



MI 



Stroke 



High BP 



Total (n) 



13 



% within group 



5.8 



8.1 



5.1 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


4 


13 


I 


3 


21 


% within 
group 


4 


10.8 


2.7 


7.7 
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Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


0 


11 


3 


3 


16 




% within 
group 


0 


9.2 


8.1 


7.7 




ACF 


2191 A AGR 120 


4 A/C 














Healthy 


Ml 


Stroke 


High BP ! 


Total (n) 




# of events 


1 


1 


0 


1 


3 




% within 
group 


1 


0.8 


0 


2.6 




ACF 


3387 T, AGR 1218 A 














Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


2 


4 


1 


3 


10 




% within 
group 


2 


3.3 


2.7 


7.7 




ACF 


3387 T, AGT 620 C/T 








Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


1 


10 


3 


2 


15 




% within 
group 


1 


8.3 


8.1 


5.1 




AGF 


L 1204 A/C, ATI 678 C/T 






Healthy 


Ml 


Stroke 


High 


Total (n) 




# of events 


5 


23 


5 


6 


37 




% within 
group 


5 


19.2 


13.5 


15.4 







Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


3 


17 


3 


4 


26 


% within 
group 


3 


14.2 


8.1 


10.3 
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ACE 1215 C, AGR 1204 A/C 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


3 


13 


5 


6 


25 


% within 
group 


3 


10.8 


13.5 


15.4 




1204 A/C, ATI 1 167 A, ACE 3906 A/G 




Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


0 


5 


1 


0 


6 


% within 
1 group 


0 


4.2 


2.7 


0 







Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


1 


4 


5 


3 


11 


% within 
1 group 


1 


3.3 


13.5 


7.7 







Healthy 


MI 


Stroke i 


High BP 


Total (n) 


# of events 


3 


13 


3 


2 


20 


% within 
group 


3 


10.8 


8.1 


5.1 





AGR 1204 A/C, AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A/G, AGR 395 T 
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High BP 


Total (n) 


# of events 
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1.7 
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2.6 
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Summary of the three previous polymorphic patterns (which involve the same polymorphic 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


4 


19 


8 


6 


34 


% within 
group 


4 


15.8 


21.6 


15.4 





AGR 1204 A, ATI 678 C, ATI 1 167 A, AGR 395 A/T 





Healthy 
October 12, 
1999 


MI 


Stroke 


High BP 


Total (n) 


# of events 


1 


2 


2 


1 


5 


% within 
group 


1 


1.7 


5.4 


2.6 





AGR 1204 A/C, ATI 678 C/T, ATI 1 167 A, AGR 395 T 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


3 


18 


5 


4 


28 


% within group 


3 


15.0 


13.5 


10.3 





Sum mary of the two previous polymorphic patterns: 



Healthy 



MI 
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Total (n) 



# of events 



20 



33 



% within group 4 



16.7 



18.9 



12.8 



AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A, AGR 395 T 
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Stroke 
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# of events 
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5.1 
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# of events 
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% within group 
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1.7 
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2.6 
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Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 
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11 




% within group 
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3.3 


13.5 ! 


7.7 




Sumrra 


irv of the three previous polymorphs 


: patterns: 
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Ml 


Stroke 


High BP 


Total (n) 




# of events 


3 


14 


6 


6 


27 


% within group 
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11.7 


16.2 


15.4 




AGT € 


,20 C. ATI 678 A, ATI 1 167 A, AG 


R 395 A/T 








Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


1 


2 


2 


1 
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% within group 
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1.7 


5.4 


2.6 




AGT i 
Sumr 
ACE 


520 C/T. ATI 678 C/T; ATI 1 167 A 


, AGR 395 T 




Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


3 


15 


4 


4 




% within group 


3 


12.5 


10.8 


10.3 




narv of the two previous polymorphic patterns: . 




Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 
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17 
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29 


% within group 
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14.2 


16.2 


12.9 




2193 A, AGR 1218 


A, AGT 803 A 
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Ml 


Stroke 


High BP 


Total (n) 


# of events 


2 


5 
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11 


1 % within group 
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4.2 
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7.7 
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Total (n\ 


44 r\ f p\/pntc 

ft oi cvcnis> 


1 
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16 


/o wiiniii group 
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9.2 


8.1 


5.1 




2328 G, AGT 620 Ci 


T 






Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


1 


11 


3 


2 


16 


% within group 


1 


9.2 


8.1 


5.1 




3387 T, AGR 1204 A/C 




Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


0 


10 


3 


3 


15 


% within group 


0 


8.3 


8.1 


7.7 





10 



15 



Example 3: Correlation Between a Specific Polymorphism Pattern 
and Treatment Response 

The following study was undertaken to define polymorphic patterns in the 
human ACE, AGT, and/or ATI genes that predict the efficacy of treatments for 
cardiovascular disease. 

Two groups of hypertensive patients were studied, 41 in the first group and 
20 in the second group. The groups were analyzed independently and in combination. 

The patients in this population were each treated with one of the following 
five ACE inhibitors: Captopril, Trandolapril, Lisinopril, Fosinopril, or Enalapril. The 
effect of the drugs on mean arterial blood pressure was quantified. Mean arterial blood 
pressure was defined as 2/3 of the diastolic blood pressure + 1/3 of systolic blood pressure. 
The individuals were also categorized as "high responders," i.e., those exhibiting a 
decrease of more than 16 mm Hg during treatment with an ACE inhibitor drug, and "low 
responders," i.e., those not exhibiting a decrease of more than 16 mm Hg. 

One particular polymorphic pattern, ACE 2193 A/G, AGR 1072 G/G, ATI 
1 167 A/ A, which was present in 51% of the first study population, discriminated between 
high responders and low responders. In the second group of 20 patients, the pattern was 
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less prevalent (25%), but the correlation with lowered blood pressure remained statistically 
significant. Individuals having this polymorphic pattern (designated "1" below) 
experienced a larger decrease in blood pressure than those lacking this polymorphic 
pattern (designated "0" below). 



Table 9 - Response to Treatment Correlates with Polymorphic Pattern 



Polymorphic Pattern 


Observations 


Mean (mm Hg) 


S.D. 




Change in B.P. 




0 


36 


-11.4 


8.6 


1 


25 


-18.1 


9.7 



Furthermore, the distribution of high responders and low responders (as 
defined above) was as follows: 

Table 10 - Responder Status Correlates with Polymorphic Pattern 



Polymorphic Pattern 


Low responder % 


High responder % 


0 


80.1 


19.4 


1 


24.0 


76.0 



Taken together, the results from the two groups indicate that the presence 
of this polymorphic pattern correlates with an incremental decrease of 6.4-7.3 mm Hg 
relative to individuals not having this polymorphic pattern. 

The prevalence of this polymorphic pattern was 41% in this hypertensive 
population. This suggests that testing for this polymorphic pattern in hypertensive 
patients, followed by prescribing ACE inhibitors only to those patients having this 
polymorphic pattern, could increase the response rate from 43% (in a hypertensive 
population in general) to 76% in hypertensive population selected according to the 
methods of the invention. 

If even one polymorphism was absent from the pattern, the high resolution 
of the variable and therefore the predictive value would be lost. 
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Example 4: Correlation Between a Specific Polymorphism Pattern 
and Treatment Response or Predisposition to a 
Cardiovascular Syndrome 

In accordance with the methods disclosed in the above examples, additional 
correlations of polymorphic patterns with either responsiveness to ACE inhibitors, non- 
responsiveness to ACE inhibitors, predisposition to myocardial infarction, or 
predisposition to stroke were derived. These correlations are presented in the following 
table: 
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Prediction: 


Non-response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Non-response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Predisposition to MI 


Predisposition to Ml 


Predisposition to Ml 


Predisposition to Ml 


Predisposition to MI 


Predisposition to MI 


Predisposition to stroke 


ro 

CJ 

Cu 

© j 

c 

CJ 

(J 


C/C 


C/C 


G/G 


C/C 


G/G 


A/A 


C/C 


C/T 


G/G 


A/A 


A/C 


G/G 


i 


A/C 


G/G 


A/C 


A/C 


A/A 


i 


Position 3 


AT1:678: 


AGT:620: 


AGT:1116: 


AGT:620: 


AGT:1116: 


AT1:1271: 


AGT:620: 


ATI:678: 


AGT:III6: 


ATI: 1167: 


AT1M271: 


vo 

H 
O 
< 




AT1M27I: 


AGRMII6: 


AT1M271: 


AT1:127I: 


AT1:1271: 


■ 


Genotype2 


C/T 


A/A 


A/A 


G/G 


C/C 


G/G 


G/G 


G/G 


A/A 


A/A 


C/C 


A/A 


A/C 


A/C 


C/T 


T/T 


T/T 


C/T 


A/T 


Position 2 


ATR:2046: 


AGR:1204: 


AGR:1204: 


AGR:839: 


AGT:620: 


AGT:II16: 


AGR:1072: 


AGR:1072: 


AGR:I204: 


AGR:1204: 


AT1:678: 


AGR:1204: 


AGR:1204: 


AGR:1204: 


AGR:620: 


ANP:449: 


AGR:449: 


AGR:620: 


AGR:395: 


Genotypel 


A/G 


C/T 


C/T 


C/T 


C/T 


C/T 


C/T 


G/C 


G/G 


C/T 


A/G 


G/G 


A/A 


T/T 


A/G 


A/G 


A/A 


A/A 


A/A 


Position 1 


ACE:2193: 
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O 
< 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:1007: 


AGR:1072: 


ACR:5496: 


ACE:2193: 


AGR:1007: 


ACE:2193: 


AGR:449: 


ACE:2I93: 


m 

ON 

<n 
ui 
CJ 

< 


ACR:5349: 


ACE:2I93: 


ACE:2193: 


Genetic 
Signature 


GSI9 


GS20 


GS2I 


GS22 


GS23 


GS24 


GS25 


GS26 


GS27 


GS30 


GS33 


GS36 


GS37 


GS38 


GS39 


GS40 


co 
O 
k. 


GS42 


GS43 
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Polymorphic patterns, i.e., combinations of sequences at particular 
polymorphic positions, that show a statistically significant correlation with myocardial 
infarction and stroke are shown below: 

Table 12 - Correlation with MI 



5 MI(n=120) 

Healthy (n=100) 





Genetic Signature 


Prevalence 


MI 


Healthy 


P-value 




ACE:2193 AA 


5.0% 


100% 


0% 


0.001 




ANP:1204 AC 










10 


AGR:449 TT 
AGRT204 AC 
ATI: 1271 AC 


8.2% 


89% 


11% 


0.002 




ACE:2193 AG 
AGT:620 CT 


4.5% 


100% j 


0% 


0.002 


15 


AGT:1167 GG 












ACE:2193 AG 
AGR:449 TT 


17.7% 


74% 


26% 


0.007 




ATI: 1271 AC 










20 


ACR:5349 AA 
AGR:449 TT 
AT1T271 AC 


14.5% 


75% 


25% 


0.013 




ACE:2193 AA 
AGT:620 CT 


3.6% 


100% 


0% 


0.009 




1 AT1:1271 AA 
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Table 13 - Correlation With Stroke 

Stroke (n=37) 
Healthy (n=100) 



Genetic Signature 


Prevalence 


Stroke 


Healthy 


P-value 


ACE:2193 AA 
AGR:395 AT 


2.9% 


100% 


0% 


0.005 


AGR:1007 AG 
AGR:1072 GG 
AT1:1167 AA 


9.5% 


85% 


15% 


0.000008 


AGR:395 AT 
AGR:1072 GG 
AGR:1218 AG 


4.4% 


83% 


17% 


0.006 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the 
foregoing description and the accompanying figures. Such modifications are intended to 
fall within the scope of the appended claims. 

It is further to be understood that all base sizes or amino acid sizes, and all 
molecular weight or molecular mass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. 

Various patents, patent applications, methodologies, and publications are 
cited herein, the disclosures of which are incorporated by reference in their entireties. 

Example 5: Correlation Between a Specific Polymorphism Pattern and High 
Blood Pressure Treatment with a ft-Blocker 

Using the techniques described in Examples 1 and 2, we investigated 
polymorphisms in the ADBP 1 and ADBR2 genes (regulator/promoter regions and coding 
regions). The source material was the same as described above. 

At the outset, we identified a set of key polymorphisms. Some of these are 
known; others are new. The are summarized in Table 14. 
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Table 14 -Polymorphic Positions in the Genes Encoding 
the Beta Adrenergic Receptors 1 and 2 





VJl-lIW»ljUCttllUII 
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Rip. 9218 

LJ 11. Z Z ~> O 


A / A 


a/p 


P/P 

VJ/ VJ 


J 


L) 1 1 . Z. J / I 


p/p 


P/T 
v^/ 1 


T/T i 




Rl P 97S7 
i_> 1 1 . z. 1 j i 


A /A 


a/p 


n/p 

VJ/ VJ 




R1R211 

D I IV. Z. J 1 


A / A 
A/ A 


a /p 

A/VJ 


P/P 




R1 R -7S8 

O i IV . /JO 


p/p 


P/T 

V_/ 1 


T/T 
l/l 




R1 R-12^1 

Lj 1 IV. IIJ 1 


p/p 


p/p 

v~,/vj 


P/P 

VJ/ VJ 


1 \J 


R1 R-1401 

L> 1 IV. I *+V/ J 


A / A 
A/A 


A /P 
A/VJ 


P/P 

VJ/ VJ | 




D J IV. I JZ.O 


A / A 

A/A 


A /P 
A/V_ 


P/P 
v_^/v^ 




RvP-Q'M 

DzJT .!7 j*+ 


A / A 

A/A 


a /p 

A/vj 


p;/p ! 
vj/ vj 






p/p 


P/P 


PJ/P 

VJ/ vJ 




r?p-ioo^ 

Di.1 . 1 \JUVJ 


A / A 
A/A 


A /P 
A/U 


Pi/P 
VJ/ VJ 


X J 


R2P* 1 1 ?0 

D Z I .11 Z.VJ 


p/p 


P/P 


O/P 

VJ/ VJ 




R2P-1221 


p/p 


P/T 


T/T 
i/i 




R2P1 S41 

DZl . I Jt I 


p/p 


P/T 


T/T 
l/l 




R2P-1 

DZ, I . 1 JUO 


P/P 


P/T 


T/T 
l/l 




R2RR1Q 


A / A 
A/A 


A /P 
A/vjr 


p;/p 

VJ/ VJ 


20 


R9R-87? 

DZ.IV. O / Z. 


p/p 


P/P 


Pi/P 

VJ/ VJ 




R2R-1 04^ 


A/A 


A/G 


G/G 




B2R-1284 


c/c 


C/T 


T/T 




B2R:1316 


A/A 


A/C 


C/C 




B2R:1846 


C/C 


C/G 


G/G 


25 


B2R:2032 


A/A 


A/G 


G/G 




B2R.2068 


no insertion 


insertion G/G 


insertion C/G 




B2R:2070 


no insertion 


insertion C/C 





Abbreviations used for the denotation of the genes: 
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BlP(ADBRl) 
BlR(ADBRl) 
B2P (ADBR2) 
B2R (ADBR2) 
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putative promoter region of the gene encoding the beta 
adrenergic receptor 1 

the protein coding region of the gene encoding the beta 
adrenergic receptor 1 

putative promoter region of the gene encoding the beta 
adrenergic receptor 2 

the protein coding region of the gene encoding the beta 
adrenergic receptor 2 



These positions, one or a combination of several, can be used to predict a 
good response to anti-hypertensive treatment with Beta-blockers. 

Indeed, we identified various polymorphisms/polymorphic patterns that can 
be used in the prediction of high response to anti-hypertensive treatment with 
beta-blockers. These are shown in Table 15. 



GS 


Positions - 
genotypes 


Prevalence 
(%) 


P-value 


Comments 


GSl 


B2R:1846-C/C 


7.7 


0.0049 


Patients show a 8.3 mmHg DBP 
reduction compared to the group lacking 
this genotype. 


GS2 


B2R:1045 - A/A 


6.6 


0.0105 


Patients will show a 8.1 mmHg DBP 
reduction compared to the group lacking 
this genotype. 


GS3 


B2R:1316 - A/A 


6.6 


0.0105 


Patients that will show a 8.1 mmHg 
DBP reduction compared to the group 
lacking this genotype. 



Not only positions in the beta adrenergic receptors, but also positions in the 
renin-angiotensin- aldosterone-system can be used to predict response to anti-hypertensive 
treatment with beta- blockers (see Table 16). The numbering give according to the 
numberings in GenBank as defined in Table 1, supra. 

Table 16 - RAAS Polymorphism Associated with p-Blocker Responsitivity 



GS 


Positions - Genotypes 


Prevalence (%) 


Comments 


GS105B 


ATR:2046 - C/C + 
AT1:678 -T/T 


10 


Patients will show more that 10-mmHg 
DBP (diastolic blood pressure reduction 
compared to the group lacking this 
genotype. 
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The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the 
foregoing description and the accompanying figures. Such modifications are intended to 
fall within the scope of the appended claims. 

It is further to be understood that all sizes and all molecular weight or 
molecular mass values are approximate, and are provided for description. 

Patents, patent applications, procedures, and publications cited throughout 
this application are incorporated herein by reference in their entireties. 
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1 A method for assessing cardiovascular status in an individual to be 

tested comprising comparing 

(a) a test polymorphic pattern comprising at least one 
polymorphic position within a gene encoding a polypeptide selected from 
the group consisting of renin, aldosterone synthase, type-2 angiotensin II 
receptor, endothelin receptor, and p-adrenoceptor of the individual, with 

(b) a reference polymorphic pattern derived from a population 
of individuals exhibiting a predetermined cardiovascular status; and 

concluding whether the individual possesses the cardiovascular status based on whether 
the test pattern matches the reference pattern. 

2. The method according to claim 1 , wherein the polymorphic pattern 
of (a) includes a polymorphic position within a second gene encoding a polypeptide 
selected from the group consisting of ACE, ATI, and AGT. 

3. The method according to claims 1 or 2, wherein the predetermined 
cardiovascular status is predisposition to a cardiovascular syndrome. 

4. The method according to claim 3 , wherein the cardiovascular 
syndrome is selected from the group consisting of myocardial infarction, unstable angina, 
hypertension, atherosclerosis, and stroke. 

5 . The method according to claim 3 , wherein the reference pattern 
comprises at least two polymorphisms. 

6. The method according to claim 5 , wherein the reference pattern 
comprises at least three polymorphisms. 

7. The method according to claims 1 or 2, wherein the predetermined 
cardiovascular status is a response to a cardiovascular treatment regimen. 
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1 8. The method according to claim 7, wherein the treatment regimen 

2 comprises administering a cardiovascular drug selected from the group consisting of an 

3 ACE inhibitor, an angiotensin II receptor antagonist, a diuretic, an alpha-adrenoreceptor 

4 antagonist, a cardiac glycoside, a phosphodiesterase inhibitor, a beta-adrenoreceptor 

5 antagonist, a calcium channel blocker, a HMG-CoA reductase inhibitor, an imidizoline 

6 receptor blocker, an endothelin receptor blocker, and an organic nitrite. 

1 9. The method according to claim 1 , wherein the polymorphic position 

2 of the gene is selected from the group consisting of a position in the renin coding region 

3 comprising a cytosine to thymine transition in exon 1 0 that creates a premature stop codon 

4 at position 387 resulting in a truncated form of renin with 20 amino acids deleted from the 

5 carboxyl terminus; a position in a promoter region of aldosterone synthase numbered -344 

6 (with the initiation codon starting at 1 ); positions in the regulatory region of P-adrenergic 

7 receptor-1 gene (designated B IP) numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 

8 2724, and 2757; positions in the p-adrenergic receptor- 1 coding region (designated B 1 R) 

9 numbered 231, 758,1037, 1251, 1403, and 1528; positions in the P-adrenergic receptor-2 

10 gene regulatory region (designated B2P) numbered 932, 934, 987, 1006, 1 120, 1221, 

11 1 54 1 , and 1 568; positions in the p-adrenergic receptor-2 gene coding region (designated 

12 B2R) numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070; positions 

13 in the endothelin receptor type A coding region (designated ET A ) numbered 969, 1005, 

14 1 1 46, and 2485; and combinations of any of the foregoing. 

1 1 0. The method according to claim 2, wherein the polymorphic position 

2 of the second gene is selected from the group consisting of positions in the ACE 

3 regulatory region numbered 5 1 06, 5349, and 5496; positions in the ACE coding region 

4 numbered 375, 582, 731, 1060, 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; 

5 position 145 1 in the ACE gene as numbered in GenBank entry X62855; positions in the 

6 ACT regulatory region numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 

7 1218; positions in the AGT coding region numbered 273, 620, 803, 912, 997, 1116, and 

8 1 174; position 49 in the AGT gene as numbered in GenBank entry M24688; positions in 

9 the ATI regulatory region numbered 1427, 1756, 1853, 2046, 2354, 2355, and 2415; and 

10 positions in the ATI coding region numbered 449, 678, 1 167, and 1271. 
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1 11. An isolated nucleic acid encoding a polypeptide selected from the 

2 group consisting of renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 

3 receptor, and P-adrenoceptor in an individual, wherein the nucleic acid comprises a 

4 polymorphic position and wherein the polymorphic position is a position in a polymorphic 

5 pattern derived from a population of individuals exhibiting a predetermined cardiovascular 

6 status. 

1 12. A probe which hybridizes at high stringency to a polymorphic 

2 position as defined in claim 1 1 . 

1 1 3. A library of nucleic acids comprising a nucleic acid which 

2 comprises 

3 (a) a polymorphic position of a gene encoding a polypeptide 

4 selected from the group consisting of renin, aldosterone synthase, type-2 

5 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, wherein 

6 the polymorphic position is a position in a polymorphic pattern derived 

7 from a population of individuals exhibiting a predetermined cardiovascular 

8 status, and 

9 (b) nucleic acids comprising other polymorphic positions of 

10 human genes that are indicative of cardiovascular status of an individual, 

1 1 wherein the human genes are selected from the group consisting of ACE, 

12 ATI , AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 

13 endothelin receptor, and P-adrenoceptor. 

1 14. The library of claim 13 wherein the nucleic acids are 

2 oligonucleotides. 

1 15. A kit for assessing cardiovascular status comprising 

2 (a) sequence determination primers and 

3 (b) sequence determination reagents, 

4 wherein the primers are selected from the group consisting of primers that hybridize to or 

5 immediately adjacent to 
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1 (i) a polymorphic position in a gene gene encoding a 

2 polypeptide selected from the group consisting of renin, aldosterone 

3 synthase, type-2 angiotensin II receptor, endothelin receptor, and P- 

4 adrenoceptor, and 

5 (ii) a second polymorphic position in a gene encoding a 

6 polypeptide selected from the group consisting of ACE, ATI , AGT, 

7 renin, aldosterone synthase, type-2 angiotensin II receptor, 

8 endothelin receptor, and P-adrenoceptor. 

1 16. The kit of claim 15, wherein the second polymorphic position is 

2 selected from the group consisting of positions in the ACE regulatory region numbered 

3 5106, 5349, and 5496; positions in the ACE coding region numbered 375, 582, 731, 1060, 

4 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; position 1451 in the ACE gene as 

5 numbered in GenBank entry X62855; positions in the AGT regulatory region numbered 

6 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 1218; positions in the AGT coding 

7 region numbered 273, 620, 803, 912, 997, 1 1 16, and 1 174; position 49 in the AGT gene as 

8 numbered in GenBank entry M24688; positions in the ATI regulatory region numbered 

9 1427, 1756, 1853, 2046, 2354, 2355, and 2415; positions in the ATI coding region 

10 numbered 449, 678, 1 167, and 1271 ; a position in the renin coding region comprising a 

1 1 cytosine to thymine transition in exon 1 0 that creates a premature stop codon at position 

12 387 resulting in a truncated form of renin with 20 amino acids deleted from the carboxyl 

13 terminus; a position in a promoter region of aldosterone synthase numbered -344 (with the 

14 initiation codon starting at 1); positions in the regulatory region of p-adrenergic receptor- 1 

15 gene (designated B1P) numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 

16 2757; positions in the P-adrenergic receptor- 1 gene coding region (designated B1R) 

17 numbered 231, 758, 1037, 1251, 1403, and 1528; positions in the P-adrenergic receptor-2 

18 regulatory region (designated B2P) numbered 932, 934, 987, 1006, 1 120, 1221, 1541, and 

19 1568; positions in the p-adrenergic receptor-2 gene coding region (designated B2R) 

20 numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070; positions in the 

21 endothelin receptor type A coding region (designated ET A ) numbered 969, 1005, 1 146, 

22 and 2485; and combinations of any of the foregoing. 
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SEQUENCE LISTING 

<110> Eurona Medical AB 

Norberg, Leif Torbjorn 
Andersson, Maria Kristina 
Lindstrom, Per Harry Rutger 

<12 0> GENES FOR ASSESSING CARDIOVASCULAR 

STATUS AND COMPOSITONS FOR USE THEREOF 



<130> 1248/1E973-US1 

<140> TBA 
<141> 

<160> 140 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 1 
tgcgtgcttc agaagtcc 

<210> 2 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 

<400> 2 
ccagggaggt gaagaaatc 

<210> 3 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 3 
agccaggcag taatgacct 

<210> 4 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 4 
gcccactgtt cccttatg 

<210> 5 
<211> 18 
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<212> DNA 

<213> "Artificial Sequence" 
<400> 5 

tgccctgact gacagagc xo 

<210> 6 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 

<400> 6 
gccccggtgt gcctgt 

<210> 7 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 7 
tgcctggata tgtgttgc 

<210> 8 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 8 
tgcctggata tgtgttgc 

<210> 9 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 

<400> 9 
gccctcgcct ctcact 

<210> 10 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 10 
tcccctctcc ctgtacct 

<210> 11 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 11 
gtgctggggt agggtaga 

<210> 12 

.1. 
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<211> 16 
<212> DNA 

<213> "Artificial Sequence" 



<400> 12 
tccccctgac ctggct 



16 



<210> 13 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 



<400> 13 
ggggcaccgt gatgtt 



16 



<210> 14 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 

<400> 14 
ggggcaccgt gatgtt 

<210> 15 

<211> 17 

<212> DNA 

<213> "Artificial Sequence" 

<400> 15 
gccagagcct ttggttt 

<210> 16 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 

<400> 16 
tggaagagcc gacttacag 

<210> 17 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 17 
tcccagaggc aaagagg 

<210> 18 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 

<400> 18 
gtttctactg cggcttcat 



16 



17 



19 



17 



19 
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<210> 19 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 19 

gtttctactg cggcttcat 19 

<210> 20 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 20 

tcctggaaga gggagtttc 19 

<210> 21 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 21 

gcaggatgag agcaacaac 19 

<210> 22 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 22 

ctggagacca ctcccatcct ttct 24 

<210> 23 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 23 

gatgtggcca tcacattcgt cagat 25 

<210> 24 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 24 

cttccgtggg actcatgt 18 

<210> 25 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 25 

tgcaccgtga ggctcta 17 
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<210> 26 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 26 
gcccaatagg aggaagca 

<210> 27 
<211> 19 
<212> DNA 

<213> "Artificial Sequence 11 

<400> 27 
cccaccccat ctccaagaa 

<210> 28 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 28 
tccctgatgg gctgctctc 

<210> 29 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 

<400> 29 
caaggccctc aaccaactc 

<210> 30 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 30 
ctcccacaaa agctccagtg 

<210> 31 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 31 
ggctcaaaat ggcaagtgtt 

<210> 32 

<211> 20 

<212> DNA 

<213> "Artificial Sequence" 

<400> 32 
gggccatgtc cttctgactc 
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<210> 33 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 33 

cagcctggag gggttaaga 19 

<210> 34 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 34 

cccttctgag cgagctgagt 20 

<210> 35 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 35 

ggccatgttg agctacttca a 21 

<210> 36 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 36 

cctccagcct tgggtcttaa 20 

<210> 37 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 37 

ttcccatccc agtctctggt 20 

<210> 38 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 38 

ggcagcctgg ttgatgagt 19 

<210> 39 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 39 

attccagctc tgaaattctc tga 23 

-6- 
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<210> 40 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 40 

gagcccct.cc agcacctc 18 

<210> 41 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 41 

acccgagcct gcccacc 17 

<210> 42 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 



<210> 43 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 43 

tcggctctgc cccttctc 18 

<210> 44 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 



<210> 45 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 45 

cggcggcagc agcaaca 17 

<210> 46 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 



<400> 42 
ggtcgggctg ggaagatc 



* 18 



<400> 44 
gccctttctc cagcttcctc t 



21 



<400> 46 
gagcccctcc agcacctc 



18 
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<210> 47 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 47 17 
acccgagcct gcccacc 

<210> 48 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 48 18 
ggtcgggctg ggaagatc 

<210> 49 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 49 18 
tcggctctgc cccttctc 

<210> 50 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 

<400> 50 21 
gccctttctc cagcttcctc t 

<210> 51 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 51 17 
cggcggcagc agcaaca 

<210> 52 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 

<400> 52 23 
atggcactta aaggtcagtt aat 

<210> 53 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 53 19 
tacggaagcc caagaagtt 
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<210> 54 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 54 

ctccccaacg gctgtctt 18 

<210> 55 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 55 

agcagcaaca tccagttctg t 21 

<210> 56 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 56 

tcccacgctc tctggactt 19 

<210> 57 
<211> 26 
<212> DNA 

<213> "Artificial Sequence" 
<400> 57 

ctgatctcag ctacacatgg atacta 26 

<210> 58 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 58 

cctgtcttgg gtgactcttc 20 

<210> 59 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 59 

ttctgggcta aatggtgaca 20 

<210> 60 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 60 

cttgtcttcg gtgtcaagtt t 21 

-9- 
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<210> 61 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 61 lg 
gggagccttg gaccacac 

<210> 62 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 62 2Q 
agcctgcatg aacctgccaa 

<210> 63 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 63 1? 
tggtgggcgt gttcaca 

<210> 64 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 64 1? 
gccagagcca gcagaga 

<210> 65 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 65 18 
ccacattcca ggggagac 

<210> 66 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 66 2Q 
cctgtcttgg gtgactcttc 

<210> 67 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 67 18 
ccacattcca ggggagac 
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<210> 68 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 68 

gtcccttcag tgccctaata c 21 

<210> 69 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 69 

acagccagat tgaaagacac a 21 

<210> 70 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 70 

aaccctttta ctggtcatgt ga 22 

<210> 71 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 71 

cgctcatggg atgtgtgac 19 

<210> 72 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 72 

tgttttcccc agtgtctatt aga 23 

<210> 73 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 73 

gcagggtcga gttacacatt t 21 

<210> 74 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 74 

cctcaggctg tcacacacct a 21 
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<210> 75 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 

<400> 75 
cggcttacct tctgctgtag t 

<210> 76 

<211> 21 

<212> DNA 

<213> "Artificial Sequence" 

<400> 76 
ctccttgaac ctgcttgtgt t 

<210> 77 

<211> 22 

<212> DNA 

<213> "Artificial Sequence" 



21 



21 



<400> 77 
gcattgaaag atgtgctgtt ct 

<210> 78 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 

<400> 78 
taacgactac aaaagcaagt cttac 

<210> 79 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 

<400> 79 
agagggcagg ggagagtct 

<210> 80 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 80 
ggcagcaggg tcagaagt 

<210> 81 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 



22 



25 



19 



18 



<400> 81 
gctggagagg agggttacat 



20 
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<210> 82 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 82 

tgcaaacttc ggtaaatgtg t 21 

<210> 83 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 83 

cagaacaacg gcagcttct 19 

<210> 84 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 84 

actggctgac ttatgctttt tact 24 

<210> 85 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 85 

gggttgaatt ttgggactca ta 22 

<210> 86 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 86 

gccagtttgc cagctataat 20 

<210> 87 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 87 

tgatgcctag ttgaatcaat aca 23 

<210> 88 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 88 

gaaggcttat gaaattcaga aga 23 
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<210> 89 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 89 

aaagtcggtt cagtccacat aa 22 

<210> 90 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 90 

aaacagcttg gtggtgatag tc 22 

<210> 91 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 91 

gcaggtgact ttggctacaa 20 

<210> 92 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 92 

aggaaacagg aaacccagta tat 23 

<210> 93 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 93 

cctgtacgct agtgtgtttc tact 24 

<210> 94 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 94 

ctggattccc caccaaatat 20 

<210> 95 
<211> 23 
<212> DNA. 

<213> "Artificial Sequence" 
<400> 95 

tgctccttct ttcacaaaat tac 23 
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<210> 96 
<211> 26 
<212> DNA 

<213> "Artificial Sequence" 
<400> 96 

cttccgttat tatgtgtgat attagt 26 

<210> 97 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 97 

gcatgtacct aaaaagtcct gtc 23 

<210> 98 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 98 

attggcatat ccatcacctt aa 22 

<210> 99 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 99 

gatctcccaa ctcatgctat ga 22 

<210> 100 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 100 

attggattca atttgcctac at 22 

<210> 101 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 101 

tttggtaata cagttgtgga tcata 25 

<210> 102 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 102 

tgcaacttgg gtagcatgtc 20 
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<210> 103 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 103 

agtcgtcccg tgtcaactat c 21 

<210> 104 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 104 

cgttgtcttc cgttattatg tgt 23 

<210> 105 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 105 

ttattgcatg tacctaaaaa gtgta 25 

<210> 106 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 106 

gcattcatat aaagatcaaa tcagt 25 

<210> 107 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 107 

caccctgata acaaaaccag ata 23 

<210> 108 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 



21 



<400> 108 
ctttctggca tcaacctcac t 

<210> 109 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 109 

acttttaagg acgaattaga gaact 25 
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<210> 110 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 

<400> 110 22 
gtccaccctt gaatttcata ac 

<210> 111 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 111 18 
cccaacctcc tccctctc 

<210> 112 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 112 19 
gctcgctctc cctcacgac 

<210> 113 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 113 17 
tccagccgct ccccatc 

<210> 114 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 114 ig 
gctcgctctc cctcacgac 

<210> 115 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 115 17 
tccagccgct ccccatc 

<210> 116 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 

<400> 116 2 2 
gcccctcaga taatgtaagc tc 
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<210> 117 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 

<400> 117 
aaccggcacg aaaactttac t 

<210> 118 
<211> 22 
<212> DNA 

<213> "Artificial Sequence 

<400> 118 
gcacctcact accaaatgag ca 

<210> 119 

<211> 22 

<212> DNA 

<213> "Artificial Sequence" 

<400> 119 
gcacttcact accaaatgag cc 

<210> 120 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 

<400> 120 
caggaaacag ctatgact 

<210> 121 

<211> 29 

<212> DNA 

<213> "Artificial Sequence" 

<400> 121 
agtcacgacg ttgtaaaacg acggccagt 

<210> 122 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 

<400> 122 
cgacgttgta aaacgacggc cagt 

<210> 123 

<211> 1278 

<212> DNA 

<213> Homo sapien 

<400> 123 



ccagacaagtgatttttgag gagtccctat ctataggaac aaagtaatta aaaaaatgta 
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tttcagaatt tacaggccca tgtgagatat gattttttta aatgaagatt tagagtaatg 120 

ggtaaaaaag aggtatttgt gtgtttgttg attgttcagt cagtgaatgt acagcttctg 180 

cctcatatcc aggcaccatc tcttcctgct ctttgttgtt aaatgttcca ttcctgggta 240 

atttcatgtc tgccatcgtg gatatgccgt ggctccttga acctgcttgt gttgaagcag 300 

gatcttcctt cctgtccctt cagtgcccta ataccatgta tttaaggctg gacacatcac 360 

cactcccaac ctgcctcacc cactgcgtca cttgtgatca ctggcttctg gcgactctca 420 

ccaaggtctc tgtcatgccc tgttataacg actacaaaag caagtcttac ctataggaaa 480 

ataagaatta taaccctttt actggtcatg tgaaacttac catttgcaat ttgtacagca 540 

taaacacaga acagcacatc tttcaatgcc tgcatcctga aggcattttg tttgtgtctt 600 

tcaatctggc tgtgctattg ttggtgttta acagtctccc cagctacact ggaaacttcc 660 

agaaggcact tttcacttgc ctgtgtgttt tccccagtgt ctattagagg cctttgcaca 720 

gggtaggctc tttggagcag ctgaaggtca cacatcccat gagcgggcag cagggccaga 780 

agtggccccc gtgctgccta agcaagactc tcccctgccc tctgccctct gcacctccgg 840 

cctgcatgtc cctgtggcct cttgggggta catctcccgg ggctgggtca gaaggcctgg 900 

gtggttggcc tcaggctgtc acacacctag ggagatgctc ccgtttctgg gaaccttggc 960 

cccgactcct gcaaacttcg gtaaatgtgt aactcgaccc tgcaccggct cactctgttc 1020 

agcagtgaaa ctctgcatcg atcactaaga cttcctggaa gaggtcccag cgtgagtgtc 1080 

gcttctggca tctgtccttc tggccagcct gtggtctggc caagtgatgt aaccctcctc 1140 

tccagcctgt gcacaggcag cctgggaaca gctccatccc cacccctcag ctataaatag 1200 

ggcctcgtga cccggccagg ggaagaagct gccgttgttc tgggtactac agcagaaggt 1260 

aagccggggg ccccctca 1278 



<210> 124 

<211> 1347 

<212> DNA 

<213> Homo sapien 

<400> 124 
gtccatctcc tcatctcctc 
ctcatggcct cattttaact 
cattctaggt cctggggttt 
acataattca gctgatattg 
ctttgaatcc atttcttctt 
gacaggaata ttgagacaat 
ccacctttca ccctttgcaa 
agatggctcc tgccggtgtg 
gcctggctgc aggtgaccgg 
gtacctgtga gcagctggca 
ctgctccaat tcaggccaag 
tgctagtcgc tgcaaaactt 
tggccaactt cttgggcttc 
atggggccac cgtcctctcc 
gagccttgga ccacacagct 
agaactgcac ctcccggctg 
gcctgctagt ggcccagggc 
tgggcgtgtt cacagcccca 
tctatacccc tgtggtcctc 
agaagattga caggttcatg 
gagccagtgt ggacagcacc 
acctctctgc tggctctggc 
caggccttgg agatgggcag 

<210> 125 
<211> 377 
<212> DNA 
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ttctcataag gacacaggtc atattagatc agggcticacc 60 

taatcatctc tttaaagatc ctgtctccaa ataatggtca 120 

aggacttcaa cacgggcatt atggccgttg gggaggtagg 180 

tgcattttgc acttggatca tgtagatatt ctccacggag 240 

ttttttgtag acatgaatga tttattctgg gctaaatggt 300 

gaaagatctg gttagatggc acttaaaggt cagttaataa 360 

aatgatattt caggtatgcg gaagcgagca ccccagtctg 420 

agcctgaggg ccaccatcct ctgcctcctg gcctgggctg 480 

gtgtacatac accccttcca cctcgtcatc cacaatgaga 540 

aaggccaatg ccgggaagcc caaagacccc accttcatac 600 

acatcccctg tggatgaaaa ggccctacag gaccagctgg 660 

gacaccgaag acaagttgag ggccgcaatg gtcgggatgc 72 0 

cgtatatatg gcatgcacag tgagctatgg ggcgtggtcc 780 

ccaacggctg tctttggcac cctggcctct ctctatctgg 840 

gacaggctac aggcaatcct gggtgttcct tggaaggaca 900 

gatgcgcaca aggtcctgtc tgccctgcag gctgcacagg 960 

agggctgata gccaggccca gctgctgctg tccacggtgg 1020 

ggcctgcacc tgaagcagcc gtttgtgcag ggcctggctc 1080 

ccacgctctc tggacttcac agaactggac gttgctgctg 114 0 

caggctgtga caggatggaa gactggctgc tccctgatgg 1200 

ctggctttca acacctacgt ccacttccaa ggtaaggcaa 1260 

cctaggactt agtatccatg tgtagctgag atcagccagt 1320 

ggggcag 1347 
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<213> Homo sapien 



<400 
cctgcccctg 
aagggcttct 
tctgttccca 
ttctcggtga 
tatgcctctg 
atgaagaaac 
ctctcctgac 



> 125 
tcttgggtga 
ccctgctggc 
tgctctctgg 
ctgaagtgcc 
acctggacaa 
cgtctccccg 
ccaggct 



ctcttccccc 
cgagccccag 
catgggcacc 
cttcactgag 
ggtggagggt 
gtagagccct 



cctgtctcct 
gagttctggg 
ttccagcact 
agcgcctgcc 
cccactttcc 
cccggtctcc 



gtctgatttc 
tggacaacag 
ggagtgacat 
tgctgctgat 
agcaaaactc 
cctggaatgt 



agggaagatg 
cacctcagtg 
ccaggacaac 
ccagcctcac 
cctcaactgg 
gggagccaca 



<210> 126 

<211> 273 

<212> DNA 

<213> Homo sapien 



<400> 126 

cctctgggag agccctcact gtgtggcctg gagccttcct aactgtgcat catctcccca 
ggaccatcca cctgaccatg ccccaactgg tgctgcaagg atcttatgac ctgcaggacc 
tgctcgccca ggctgagctg cccgccattc tgcacaccga gctgaacctg caaaaattga 
gcaatgaccg catcagggtg ggggaggtat ttaccttcct tgcctacctg gtccattgca 
caggtgagca tgattaagga aaagagctat ggt 



<210> 127 

<211> 945 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
240 
300 
360 
377 



60 
120 
180 
240 
273 



<400> 
agcccaccgc 
tcctgatcct 
cggatgagag 
tgaccctgaa 
tcctgggccg 
gcctggcaag 
ggacaaatca 
gagctggaaa 
gcctgcagcg 
gggaggcaag 
agcttgtttg 
aaaattaaag 
aacatgacct 
ttgaacaata 
ggttatttct 
ctttattaga 



127 
cggccctcta 
gatgtctggt 
agagcccaca 
ccgcccattc 
cgtggccaac 
gcctctgccc 
gcgatgtgtc 
gcagccgttt 
gcacaaatgc 
aaccagtgtt 
tgaaacaaaa 
tatacatttt 
ccgtgtagtg 
cgtgaaagat 
cccttgtgtt 
cgccaacaga 



gccctcacga 
cctttgcagg 
gagtctaccc 
ctgtttgctg 
ccgctgagca 
ctggcctttg 
acccccagtc 
ctccttggtc 
acctcccagt 
tagcgcggga 
aagtgttccc 
tgcattgcct 
tctgtaatac 
gcaagcacct 
agtaataaac 
tgtatacatt 



ccctgggtca 
tgctgaacag 
aacagcttaa 
cgtatgatca 
cagcatgagg 
aggcaaaggc 
tcccaccttt 
taagtgtgct 
ctgctgggtt 
ctactgttcc 
t tttcaagtt 
tcggtttgta 
cccagttttt 
gaatttctgt 
gncttgccac 
cagccagata 



cccatgcgcc 
cattttttct 
caagcctgag 
aagcgccact 
ccagggcccc 
cagcagcaga 
tcttctaatg 
gcatggagtg 
tattttagag 
aaaaagaatt 
gagaacaaaa 
tttagtgtct 
tccacagatg 
ttgaacgcgg 
actaagcctc 
gactg 



ctcagaatga 
gagcttgaag 
gtcttggagg 
gccctgcact 
agaacacagt 
taacaacccc 
agtcgacttt 
agcagtagaa 
aatgggggtg 
ccaaccgacc 
attgggtttt 
tgaatgtaag 
cttgtgattt 
aacaatagct 
caaatttact 



<210> 128 

<211> 1856 

<212> DNA 

<213> Homo sapien 

<400> 128 

gtgagagctc atgtgcaggc tgagtgagag gcgagggctg ggactggcat ggggcccggg 
ggtgctgggt gagagcacag agctgggctc ccctcgctct cggggtcagc gtgcccagga 
aatgcccttt cttgttttcc acgagggggg cttctctgcc cactgagagc cggcacctac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



60 
120 
180 
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ttcataccat 

caccctctcc 

ttcatgggct 

ttcgcgaccc 

atgtgactga 

cctctgattc 

agctcttgac 

aaagagctca 

caccctttcc 

cagaaatggg 

ggggaaatgg 

tgtggacgtg 

ccctaacccc 

gaccagcctg 

cgtggcggta 

agcccagcca 

tggacgcggt 

cttgagcctg 

ggcaacagag 

tcaagcacgc 

ccggggactc 

cccgctgcct 

aggctggagt 

cattctcctg 

taattttttg 

ctcctgacct 

acagccactt 

cttacaagca 



gccccgatca 

tgtcctctct 

tggactccaa 

tgcccttgac 

taaggaaatt 

ctctcacttc 

tgtcctcttt 

cccccgacac 

agggttggag 

tctttctggc 

aaaacaaaca 

ggtacccatg 

agcactttgg 

gccaacatgg 

gatgcctgta 

gggcaataca 

ggtgcatgcc 

ggaggtcaag 

tgagaccctg 

ccctcacagg 

tgtaagccac 

atacagtcac 

gcagtggcgg 

cctcagcctc 

tatttttagt 

cgtgatccgc 

ttatgtggtt 

gaggtgagct 



gccgcccctc 

gcatgccgcc 

acggggggaa 

tccctcacct 

gagtcacaca 

cctcccctac 

tctccccact 

aaggacccgc 

aacttgaggc 

ccccagccca 

ggagaaacgg 

ccaagtgtga 

gaggccaagg 

cgaaacccca 

gccccagcta 

gcaagacccc 

ugtagtccta 

gccgcagtga 

tctcagaaag 

actgctgagg 

cgctggagac 

cttttttttt 

gatctcggct 

ccaagtagct 

agagacgggg 

ccgcctcggc 

tcgccaattt 

aagggctgga 



cctcagaacc 

cctcagagca 

ctcggggaca 

tctctgtctt 

gcccctgaaa 

cctgccactt 

ggctggtcct 

acacagatac 

ataaacattc 

gctcccacat 

ttttcccagg 

ggggaggctg 

tgagtagatc 

tctgtactaa 

cttgggaggc 

gtctctacaa 

gctgctaggg 

gccgagatgg 

aaaaaaaaaa 

ccctgcaggt 

cactcccatc 

tttttgagac 

cactgcaacg 

gggaccacag 

tttcaccgtt 

ctcccaaagt 

tattccagct 

gctcaagcca 



gccctctgct 

gcgggatctc 

ctagctcccc 

tcctgagccc 

gcgccagact 

cctactggat 

tcttagcccc 

ctcccagctc 

ctccatgagg 

tagaacaatg 

acagggtctg 

gccgggngtg 

acttgaggcc 

aaatacaaaa 

tgaggcatga 

ataaaataca 

aggctgagat 

cgccactgca 

aaaaaggaga 

gtctgcagca 

ctttctccca 

ggagtctcgc 

tccgcctccc 

cgcccgccac 

ttagccggga 

gctgggatta 

ctgaaattct 

ttcaaccccc 



taagggtgtc 

aaagttatat 

ccggcctcct 

ctctcccagc 

agaacctgag 

agaagtagac 

agcccgtttg 

cctctcaacc 

aatctccacc 

acaaatagaa 

gcctacaagc 

gcggctcatg 

gggagcttga 

gttagctggg 

gaatcgcttg 

aaaaattagt 

ggaaggattg 

ctccagcctg 

ggagagagac 

tgtgcccagg 

tttctctaga 

tctgtcgccc 

gggttcacgc 

tacgcccggc 

tggtctcgat 

caggcgtgat 

ctgagctccc 

taccag 



<210> 129 

<211> 4020 

<212> DNA 

<213> Homo sapier. 



<400> 
gccgagcacc 
gctgccgctg 
gccgcagccc 
caactccagc 
caacatcacc 
tgcggaggcc 
ggacccgcag 
cctggctaag 
cgccaaggtc 
caacatcctg 
caacgctgcg 
agcctacaag 
ccccaccttc 
cctccatgcc 
caggggaccc 
ctacgacatg 
gcagcagggc 
ggagctctcc 
cgggcgggaa 
gatcaagcag 



129 
gcgcaccgcg 
ccgctgctgt 
ggcaactttt 
gccgaacagg 
gcggagaatg 
tggggccaga 
ctgcgcagga 
cggcagcagt 
tgcctcccca 
gcttcctcgc 
ggcatcccgc 
caggacggct 
gaggacgatc 
ttcgtccgcc 
atccctgctc 
gtggtgccrt 
tggaacgcca 
cccatgcctc 
gtggtgtgcc 
tgcacacggg 



tcatgggggc 
tgctgctgcc 
ctgctgacga 
cgctgttcca 
caaggcgcca 
aggccaagga 
ccatcggagc 
acaacgccct 
acaagactgc 
gaagctacgc 
cgaaaccgct 
icacagacac 
cggaacacct 
gcgcactgca 
acctgctggg 
ccccagacaa 
cgcacatgtt 
ccgagttctg 
acgcctcggc 
ccacgatgga 



cgcctcgggc 
gccgcagccc 
ggccggggcg 
gagcgtggcc 
ggaggaagca 
gctgtatgaa 
tgtgcgaacc 
gctaagcaac 
cacctgctgg 
catgctcctg 
gtacgaggat 
gggggcctac 
ctaccaacag 
tcgccgatac 
agacatgtgg 
gcccaacctc 
ccgggtggca 
ggaagggt eg 
ttgggacttc 
ccagctctcc 



cgccgggggc 
gccctggcgt 
cagctcttcg 
gccagctggg 
gccctgctca 
ccgatctggc 
ctgggctctg 
atgagcagga 
tccctggacc 
tttgcctggg 
ttcactgccc 
tggcgctcct 
ctagagcccc 
ggagacagat 
gcccagagct 
gatgtcacca 
gaggagttct 
atgctggaga 
tacaacagga 
acagtgcacc 



eggggctget 

tggaccccgg 

cgcagagcta 

cgcacgacac 

gecaggagtt 

agaacttcac 

ccaacctgcc 

tctactccac 

cagatctcac 

agggctggca 

tcagcaatga 

ggtacaactc 

tctacctgaa 

acatcaacct 

gggaaaacat 

gtactatgee 

tcacctccct 

agccggccga 

aagacttcag 

atgagatggg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1856 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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ccatatacag 
ccccggcttc 
tctgcacaaa 
cttgctaaaa 
gtggcgctgg 
gtatcttcga 
tgatgctgga 
ttttgtcctg 
actgcaccag 
gcaggctggc 
cctggatgcc 
gaaccagcag 
tgacaactac 
ggaggaatat 
ctacaacacc 
agccaaccac 
gaacaccact 
tgcccaggag 
ggccactgtg 
gatggccaca 
ggcggggaga 
ccggctcaat 
cctggagcaa 
tgcctacgtg 
gcccattcct 
cttggtggtg 
gggctggacg 
gctgcccgtg 
ggaggtggtc 
gcagtgcacc 
ccagtatttc 
cttccatgag 
cagtctcaac 
gaagatggcc 
ctggagggta 
caggctgaag 
aggggccaag 
catccagttc 
caagtgtgac 
gggcttcagt 
cgcctcggcc 
gctgcatggg 
agaagggccc 
gcaggcccgc 
cctgggcctc 
cgggccccag 
ggccctgccc 



tactacctgc 
catgaggcca 
atcggcccgc 
atggcactgg 
ggggtcttta 
accaagtatc 
gctaagtttc 
cagttccagt 
tgtgacatct 
tcctccaggc 
cagccgctgc 
aacggcgagg 
ccggagggca 
gaccggacat 
aacatcacca 
accctgaagt 
atcaagcgga 
ctggaggagt 
tgccacccga 
tcccggaaat 
gccatcctcc 
ggctatgtag 
gacctggagc 
cgccgggccc 
gctcacctgc 
cccttccctt 
cccaggagga 
cctcctgagt 
tgccacgcct 
accgtgaacc 
atgcagtaca 
gccattgggg 
ctgctgagca 
cttgacaaga 
tttgatggaa 
taccagggcc 
ttccacattc 
cagttccacg 
atctaccagt 
aggccgtggc 
atgttgagct 
gagaagctgg 
ctcccagaca 
gtgggccagt 
agccagcggc 
ttcggctccg 
aagggcctcc 



agtacaagga 
ttggggacgt 
tggaccgtgt 
aaaaaattgc 
gtgggcgtac 
aggggatctg 
atgttccaaa 
tccatgaagc 
accggtccac 
cctggcagga 
tcaagtactt 
tcctgggctg 
tagacctggt 
cccaggtggt 
cagagaccag 
acggcaccca 
tcataaagaa 
acaacaagat 
atggcagctg 
atgaagacct 
agttttaccc 
atgcagggga 
ggctcttcca 
tgcaccgtca 

tggggaacat 

cagccccctc 
tgtttaagga 
tctggaacaa 
cggcctggga 
tggaggacct 
aagacttacc 
acgtgctagc 
gtgagggtgg 
tcgcctttat 
gcatcaccaa 
tctgcccccc 
cttctagcgt 
aggcactgtg 
ccaaggaggc 
cggaagccat 
acttcaagcc 
gctggccgca 
gcggccgcgt 
ggctgctgct 
tcttcagcat 
aggtggagct 
caccagagac 



tctgcccgtc 
gctggcgctc 
caccaatgac 
cttcctgccc 
ccccccttcc 
tcctcctgtt 
tgtgacacca 
cctgtgcaag 
caaggcaggg 
ggtgctgaag 
ccagccagtc 
gcccgagtac 
gactgatgag 
gtggaacgag 
caagattctg 
ggccaggaag 
ggttcaggac 
cctgttggat 
cctgcagctc 
gttatgggca 
gaaatacgtg 
ctcgtggagg 
ggagctgcag 
ctacggggcc 
gtgggcgcag 
gatggacacc 
ggctgatgat 
gtcgatgctg 
cttctacaac 
ggtggtggcc 
tgtggccttg 
cctctcagtg 
cagcgacgag 
ccccttcagc 
ggagaactat 
agtgcccagg 
gccttacatc 
ccaggcagct 
cgggcagcgc 
gcagctgatc 
gctgctggac 
gtacaactgg 
cagcttcctg 
cttcctgggc 
ccgccaccgc 
gagacactcc 
tgggatggga 



tccctgcgtc 
tcggtctcca 
acggaaagtg 
tttggctact 
cgctacaact 
acccgaaacg 
tacatcaggt 
gaggcaggct 
gccaagctcc 
gacatggtcg 
acccagtggc 
cagtggcacc 
gctgaggcca 
tatgccgagg 
ctgcagaaga 
tttgatgtga 
ctagaacggg 
atggaaacca 
gagccagatc 
tgggagggct 
gaactcatca 
tctatgtacg 
ccactctacc 
cagcacatca 
acctggtcca 
acagaggcta 
ttcttcacct 
gagaagccaa 
ggcaaggact 
caccacgaaa 
agggagggtg 
tctacgccca 
catgacatca 
tacctcgtcg 
aaccaggagt 
actcaaggtg 
aggtactttg 
ggccacacgg 
ctggcgaccg 
acgggccagc 
tggctccgca 
acgccgaact 
ggcctggacc 
atcgccctgc 
agcctccacc 
tgaggtgacc 
acactggtgg 



ggggggccaa 

ctcctgaaca 
acatcaatta 
tggtggacca 
tcgactggtg 
aaacccactt 
actttgtgag 
atgagggccc 
ggaaggtgct 
gcttagatgc 
tgcaggagca 
cgccgttgcc 
gcaagtttgt 
ccaactggaa 
acatgcaaat 
accagttgca 
cagcgctgcc 
cctacagcgt 
tgacgaatgt 
ggcgagacaa 
accaggctgc 
agacaccatc 
tcaacctgca 
acctggaggg 
acatctatga 
tgctaaagca 
ccctggggct 
ccgacgggcg 
tccggatcaa 
tgggccacat 
ccaaccccgg 
agcacctgca 
actttctgat 
atcagtggcg 
ggtggagcct 
actttgaccc 
tcagcttcat 
gccccctgca 
ccatgaagct 
ccaacatgag 
cggagaacga 
ccgctcgctc 
tggatgcgca 
tggtagccac 
ggcactccca 
cggctgggtc 
gcagccgagg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 



<210> 130 

<211> 2720 

<212> DNA. 

<213> Homo sapien 



<400> 130 

aagcttgctg gggttttgat agagattttg tttaacctgt agancatttg aagattaatg 



60 
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ccattgtaac 
ggtctacctt 
ttgttaaatt 
tgttttgaga 
tcacagcaac 
agtagctgga 
gcatttttag 
ttgtgattca 
caggtcattt 
ttgaattttt 
aacaaaccag 
aatgtacgaa 
accttgacag 
actttttgag 
gtgagaagcc 
gtatagaaat 
gtttcttagt 
tcatctacaa 
tcttgtccaa 
agacattttg 
gatattagtt 
ctaatttgtt 
ttttttagac 
atacacaata 
ttcggagctg 
cgagtgacag 
cagtgcaatt 
tcaagttatt 
ccctactcac 
gcatcaacct 
cctacatttt 
gatgtcttta 
tcatccttac 
aaatatttat 
aggaacaaat 
atgatccaca 
taattcgtcc 
ccgcaggaaa 
ggggttcggg 
ccttgctgcc 
ggcggaggag 
cccctcacta 
accccaggca 
ggccgcggcg 
ccgggacgcc 



gatattaaat 
atttcaacaa 
tatttcttag 
tggagtctca 
ctctacctcc 
actacaggca 
nagagacagg 
cccacctcgg 
cttgatattt 
aaatttttaa 
ggtgccagaa 
ggtatctgct 
atctggcatc 
gcacggcagc 
aacagtggaa 
acaactgatt 
tctaatagtt 
atagaaattg 
ttgccctcac 
tcttgttcct 
aagttaagtt 
gagtgttagc 
aatcatgtag 
gatgtacact 
cctcctcgcc 
gactttttag 
ggcatatcca 
ttctcctagc 
ctatctaaca 
cacttgacca 
gttgagaatt 
tctggttttg 
tcttctattt 
tgggttttta 
gagtgtccac 
actgtattac 
ttaaaagttt 
tgatactcct 
accaggtgaa 
gtcaatatcc 
gagggaatgc 
taaattcgga 
gcagcgagtg 
gtgatcgatg 
gaggcggcgg 



ctttcaatcc 
ttctttttgt 
ggtttttttt 
ctctgtcacc 
tgggttcaag 
tgcaccacca 
gtttcacgat 
cctcccaaag 
ttactctttt 
cacagttcaa 
ctgcttcagt 
gcatcaaact 
ttttcacctg 
atgtgaggca 
ttgttttctt 
tttgtatatt 
aattaattga 
ttttactttc 
tagaaccttt 
gatcttagac 
tttcataaat 
atgaaaaagt 
gctttgtcca 
ttttaggtac 
gatgattcca 
gtacatgcaa 
tcaccttaaa 
tactctgaaa 
ttaattgatt 
tggtgtatag 
tttatctata 
ttatcagggt 
tttggaagag 
aaatacattt 
ccttgaattt 
caaagttcga 
tccaagttca 
gtacccccag 
cgctgatctg 
cgagagggag 
aaaacagagc 
gctgcctcct 
acaggacgtc 
gggagcggct 



aagaacatgg 
ttgttttcag 
gttttgtttt 
caggctggag 
cgattattct 
cgcctggcct 
gttggccagc 
tgctgagatt 
gatctatagt 
atcagtgtgt 
ttctctgcct 
ttaaacttca 
gtcgtaagca 
gggagaggac 
aattccattt 
gatcttgtat 
ttccttaggg 
tttctaatct 
agtacaaagt 
ataaaaacgt 
aaacttcaca 
gttgaatttt 
ttttttactt 
atgcaataat 
gcgcctgaca 
taatttaatg 
tatttgtctt 
tatacaatag 
tttggtaaac 
tccctttcat 
ctcttaagaa 
gatactggcc 
tttgtgaaga 
ttaaaatgca 
cataaccctc 
gttactcata 
gaaaaaaaaa 
ctcgctctcc 
atagttgaca 
gaggttgggc 
ctcgtccccg 
cgccaatgat 
tggaccggcg 
ggagcggacc 



aatgtcattc 
actacaagtt 
gttttgttgg 
tgcagtggca 
gcctcagcct 
tttttttttt 
ctggtctcga 
acaggagtga 
aagtaaaatt 
cxgatttcat 
tctctttgtc 
cattatcctt 
gaaagtcctc 
acagacccac 
gttgattgtt 
tctaaaaact 
ctttttaata 
ggatgccatt 
taaatagaaa 
tgtcttccgt 
gtttgaggaa 
gtccaagagt 
ctttaaattt 
ttaatgcccc 
gccaggaccc 
cattcatata 
tttcttcatg 
attactgtaa 
taatctaatc 
atgttattgg 
atattgatct 
tcatagcatg 
attgatacta 
acttgggtag 
ggaattaatc 
ggaaagagaa 
atgttgaaga 
ctcacgaccc 
cgggacgact 
cgggagggtc 
gaacccaaga 
tccagcgcct 
cgccgctagc 
cagcgagtga 



catttattta 
ttagatcctt 
ttggttggtt 
caatctcagc 
cctcctcctg 
tttttctttt 
atccctgacc 
gccactacac 
gtttttatct 
ctccttctct 
tatgatgact 
atttctcttg 
gatctcctta 
acagcaagtg 
tattgctagt 
tgctcaactt 
caagatcatg 
tatctttttt 
tgggaagact 
tattatgtgt 
gttcctattc 
ttttaaaaat 
attttatttg 
tcactataaa 
caggcagcag 
aagatcaaat 
ctagaaacat 
actacagtca 
ttgctttctg 
attcaatttg 
gtagtctcgt 
agttgggaga 
tttcttcttt 
catgtccaat 
catgtaatct 
agaagttctc 
acacgaactc 
ctcgctaggc 
gtiggcatcat 
tccggggcgg 
agcagcaacg 
gacagccagg 
agctctgccg 
gggcgcacag 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2720 



<210> 131 
<211> 480 
<212> DNA 
<213> Homo sapien 

<400> 131 

gaattctcag agctggcgaa acagtctggt ccaagcagcc tctcagcagt gcctttcagc 

ctcccctctc tgagtctttc caccccttgc tggtacttta gtttcttcca cttctagcac 

cacgtgtagt ttcccaattt ctcttaccca aatttgctca cagggaaaaa aataaattaa 



60 
120 
180 
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attagccatt 
ggtctcatag 
aaactggaca 
cattttccag 
cagaggaggt 



tacaccacag tgtgaactta ataacaccaa caaaagttcc aaagctctag 

cacctccaga tccatgatct cattcggtgt ttccaacaat gttttgcacc 

catgcttgct acttcatcat cctcatcgtg aacattatta ttattatcat 

atgaagaaaa tgaatcacaa gccaactgac agcccaaagg ctccacagct 

aaatcatgtg cttaattcag aacttttggc tcccatcact atgctcttcc 



240 
300 
360 
420 
480 



<210> 132 

<211> 2268 

<212> DNA 

<213> Homo sapien 



<400> 
accccaggca 
ggccgcggcg 
ccgggacgcc 
gtgacgcagc 
ttttatctga 
agtgtttgca 
tggtattaaa 
catgattcct 
ggtgatagtc 
tttagcactg 
tatggaatac 
tttcaacctg 
tattgttcac 
catcatcatt 
atttttcatt 
aacccttccg 
gatcattctc 
gaagaacaaa 
tttcttttcc 
catcatacgt 
tatagcttat 
taaaagatat 
cctttcaaca 
caagaagcct 
attttgtgaa 
ttagctactt 
aagctctgaa 
cattagacag 
tgagaaattt 
atttccacat 
gttccagatt 
tattagcaac 
gcagtagtcg 
actgccaaaa 
ataaaagtta 
tatattagtt 
tcataaagta 
tatccatatc 



132 
gcagcgagcg 
gtgatcgatg 
gaggcggcgg 
gcccggggcg 
ataactcact 
acaaattcga 
agaatccaag 
actttataca 
atttactttt 
gctgacttat 
cgctggccct 
tacgctagtg 
ccaatgaagt 
tggctgctgg 
gagaacacca 
atagggctgg 
acaagttata 
ccaagaaatg 
tggattcccc 
gactgtagaa 
tttaacaatt 
tttctccagc 
aaaatgagca 
gcaccatgtt 
agaaggagca 
ttcagaattg 
caaaagcttt 
atgacggctg 
tactgacaga 
aaaggtattt 
gttctgtcca 
tgtgctacac 
ccaagttgca 
caatgcccgc 
aactacttgt 
tgatttaata 
tgccttcctg 
tctaaaccgc 



acaggacgtc 

gggagcggct 

gcgggagacc 

cgggtttgat 

gatgccatcc 

cccaggtgat 

atgattgtcc 

gtatcatctt 

atatgaagct 

gctttttact 

ttggcaatta 

tgtttctact 

cccgccttcg 

caggcttggc 

atattacagt 

gcctgaccaa 

ctcttatttg 

atgatatttt 

accaaatatt 

ttgcagatat 

gcctgaatcc 

ttctaaaata 

cgctttccta 

ttgaggttga 

agagaacatt 

aaggagaaaa 

tctttccttt 

ctcgaagaac 

aatgcaatct 

agaatatatt 

gcttccaaag 

tcgcacctgg 

gatctttttg 

aagatggctt 

aaaggtgctg 

tctgagaagc 

tttaaaaaaa 

tgttaattga 



tggaccggcg 

cgagcggacc 

cgcaccagcg 

atttgacaaa 

cagaaagtcg 

caaaatgatt 

caaagctgga 

tgtggtggga 

gaagactgtg 

gactttgcca 

cctatgtaag 

cacgtgtctc 

acgcacaatg 

cagtttgcca 

ttgtgctttc 

aaatatactg 

gaaggcccta 

taagataatt 

cacttttctg 

cgtggacacg 

ccttttttat 

tattccccca 

ccgcccctca 

gtgacatgtt 

cctctgcagc 

tgcattatgt 

tgcaacaaga 

aatgtcagaa 

ccctagcctg 

aaatcgttag 

ggcagtaaag 

tactgcacat 

tgaaattcaa 

atttgtataa 

cactggtccc 

gtatatagtc 

gtatatattc 

ntaaaatctg 



cgccgctagc 

cagcgagtga 

cagccggccc 

ttgatctaaa 

gcaccaggtg 

cccaactctt 

aggcataatt 

atatttggaa 

gccagtgtt^t 

ctatgggctg 

attgcttcag 

agcattgatc 

cttgtagcca 

gctataatcc 

cattatgagt 

ggtttcctgt 

aagaaggctt 

atggcaattg 

gatgtattga 

gccatgccta 

ggctttctgg 

aaagccaaat 

gataatgtaa 

cgaaacctgt 

acttcactac 

ggactgaacc 

caaagcaaag 

actcgatgaa 

cttttgtcct 

aggagcaaca 

ttttcgtgcc 

tttgtacaaa 

cctgtgtctt 

tggtgttact 

aagtagtagt 

cgcggtaaaa 

cacacatata 

gcaaagtt 



agctctgccg 

gggcgcacag 

tcggcgggac 

atggctgggt 

tatttgatat 

ctactgaaga 

acatatttgt 

acagcttggt 

ttcttttgaa 

tctacacagc 

ccagcgtcag 

gatacctggc 

aagtcacctg 

atcgaaatgt 

cccaaaattc 

ttccttttct 

atgaaattca 

tgcttttctt 

ttcaactagg 

tcaccatttg 

ggaaaaaatt 

cccactcaaa 

gctcatccac 

ccataaagta 

caaatgagca 

gacttttcta 

ccacattttg 

tgtgttgatt 

gttatttttt 

ggagatgaga 

ggttttcagc 

gatatgctaa 

ataggtttac 

aaagtcacat 

gtcctcctag 

agattatata 

tatatatgta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2268 



<210> 133 
<211> 826 
<212> DNA 
<213> Homo sapien 
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<400> 
gatctaccca 
ctctactctt 
tcacccatgt 
gccaaagtta 
gctggaaacc 
ctcatcagcc 
gtcactggac 
tccggaaacc 
actgccccgc 
atgcatggag 
gagagagccc 
gctgcrgctg 
acggtaattg 
aacaccctgg 



133 
ccttggcctc 
ataattaaac 
tctagatggg 
atatagagaa 
taggttttac 
tctgcctgct 
acaagattgc 
tgggtaccct 
catccacccc 
tgtataaaag 
cagaccgagg 
ctctggggct 
gtaactcagg 
cttttccaca 



ccaaagtgct 
cagctgttgc 
aaaactgggc 
tggagcttcc 
gcttgtccca 
ccaggggtca 
tttcccacag 
tcacccacct 
agggtaataa 
gggaagggct 
gaagcatgga 
cctgtacctt 
cagagaaggg 
tgcggtgtca 



gggacaggtg 
ttttcctgcc 
tgtagctggg 
agggtatagg 
gt tttgatgt 
cagggccaag 
ctgtccttcc 
agctctgtcc 
atcagggcag 
aagggagcca 
tggatggaga 
tggtctcccg 
gtgggcaggg 
ttcagtcctt 



tgagccacca 
aagaaaccag 
agaggccagt 
ggttgggtct 
tagccctgac 
ccagatagag 
tccagcccct 
cgcagtgaga 
agcagaattg 
cagaacctca 
aggatgcctc 
acagacacca 
gtgtaggttc 
acgatc 



tgcctggccc 
tcatgaagat 
cagggacaaa 
gggctaggga 
agtgctgttt 
ggctgctagc 
ctgctcccca 
tttattgctg 
caatcacccc 
gcggatctca 
gctggggact 
ccacctctaa 
ccaccttccc 



<210> 134 

<211> 6910 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
826 



<400 
ggatcctgca 
accaggagga 
tcacatggaa 
gtggacattt 
taaagtctat 
ccattttact 
catggacccc 
cagcaatggg 
gccttgacct 
tatgtttcca 
gagcattgga 
cctgcaaagg 
gtttgaagcc 
ggagcagggt 
cattctcagg 
gcccaggacc 
cctttggagg 
gatagtgcag 
tgaaaggttc 
gggagggtgg 
accccgtcca 
ggtgtgtgtg 
ctgcaggatg 
cgtgttcttg 
gggtgcaggg 

ggtgggcagt 

cactgtgggt 
aagacgcaga 
gctgctttct 
agaaggggac 
gccgcagcct 
gacagtccgg 
aagcaaggca 



> 134 

aggagggata 

gaccccatgt 

ccagtgcgct 

tctgcagttt 

taaaagaatc 

cctctcagcc 

cagtccagac 

gaattaggca 

Lcgccctgag 

gagcaggttc 

atggcactca 

gcacgggcac 

atgccccagc 

tatgagcacc 

taaagccctc 

ctgccaggca 

atggggaagg 

agaaaacccc 

cagggagcca 

acaggagaca 

gcagggcctc 

atgctgccgg 

atcctggagc 

ttgtaagcgg 

gggaggcgag 

gagggtgagc 

tggggaagca 

gcaatgagcc 

cttgcggtct 

ttccactggc 

ccctcagggc 

ggcaggaggc 

caaggcgggg 



caaattacat 

gactccagga 

cctgtggtgg 

ttgatcaatt 

caaggctccc 

ctggagaaag 

cccacgcctt 

cctgacttct 

agtctcaggc 

ctgggtgaga 

gggcaaaggc 

tgggcactag 

atccaggcaa 

tgcacctgga 

cctggccctc 

ctcagcactg 

agcgcagcac 

agctcactgc 

gcaggagggc 

ctttggattg 

ctgcttggcc 

aggatgtgga 

cctgggtggc 

cgagttggga 

agaggtagga 

acccggccca 

gggcggtggt 

cttctgtgta 

ggggactgtc 

ccaggtcaca 

gctcagggtc 

aggcggggac 

cggntgatgg 



acatttgtca 

ccctggttga 

agggtgtacc 

ttgcaatgaa 

tctcatctca 

gagaggccag 

ttctcagcat 

ccttcatcta 

aggtccagag 

taaaaggatt 

agaggtgcgc 

agccgctcgg 

caggtggctg 

gatgcaccag 

gctgggaaca 

ccattcccag 

atgctggtct 

agagagggca 

tt.tagctgtg 

ggactgcagg 

ccacaggtac 

gaagctgcaa 

ctacagacaa 

gctgagagct 

cccaaaagca 

gaggacggcc 

ggagaaatgg 

gtgagaaccc 

cttcccatag 

ggctgagtgc 

cctgcagtcc 

gcaggtggct 

cactggggag 



aaacccacag 

caacaacgta 

tgtgtcaggg 

ctaaatctgt 

cgataagata 

gtcccaccac 

cctcagacca 

cctttggctg 

ccagttctcc 

cgggccgaac 

gtggcagcgc 

gcccctagga 

aggctgctgc 

accttccagg 

cccagatccc 

caggtcccgg 

gtggtgctgc 

ggactcagaa 

aagccgctaa 

gcggggccac 

aacttgggag 

caggtggaca 

catcgtgggc 

gggagcaggg 

catctgccct 

atcctgtggg 

gcacgggcac 

gctctgcacc 

gccagaaaac 

ngagcctggt 

cggcaaacct 

ggtggttccg 

gatgtttcct 



catgttgacc 

tcgagattcc 

cagggggtac 

ggtataaaaa 

aagtccccat 

cctccaccag 

gcaggacttg 

ggggccccca 

catgacgtga 

agggtggagg 

cctggctgtc 

cggtgctgcc 

agatctggag 

agctggggcc 

tgcccctgct 

cactctgcat 

cagggcaggg 

gcactaaagt 

tccaggagca 

gagggacatg 

gaccacgcat 

gcctgcatcc 

acaaatgtgg 

tgggcagcct 

gggcccctgt 

gtcgcgtctg 

ctctgcagag 

aacctcggcg 

tgaggccctg 

gttcgccggg 

tcctgatggg 

ttgttctcag 

ggcccgtgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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gagggtggcg 
gggatgttgt 
ccactccatg 
cgtgggtcgg 
ccaggaaggc 
ctgtgtgtgt 
tgtgtagtgt 
gtgtgtgtgc 
tctcttcaca 
ctgtcatcac 
gaatgtggcc 
cacgcactct 
aagatataga 
cggggaaggg 
gaggtgggca 
ttactacagg 
gaatgacctc 
agacctgagt 
acacaaacag 
ggggacaagg 
ctctgccctg 
ccagatgtgc 
agggacttct 
ctggacgtcc 
tccagcccca 
ggaggcagcc 
gcaacttagc 
gcctgaactt 
tgcccaggag 
gggactgcat 
accatggggg 
gggtcgggga 
gctgatccgg 
ggaggaggac 
tggacaggtg 
cactacacag 
aaggccaact 
cccacccaga 
gcacccacgt 
ccctcctttg 
gagactgagt 
tgaggccctg 
aggagggttg 
ccttgggcca 
gctccagcca 
gggagccggg 
tggcngtgag 
gtcacctttc 
• cggggtccca 
ttgtcctggg 
catggcttct 
ccgacgtgca 
atccccagaa 
ggtgggtgct 
tgggtggtgg 



cctggccagg 
cattaggacg 
gtaggaaggg 
ggccttcact 
aggtccagga 
gtcttgcatc 
gtgcacgtgt 
atgtgcaggt 
catggtgttg 
tgtcaacagc 
gacaggggaa 
ggctgtgggt 
tgcttcacag 
tggggaggag 
aagcttgaca 
gagggcccgc 
agttaccctg 
ggcctttgcc 
gcagcttcta 
aggatgggat 
gcctctgtag 
tgtcgcccaa 
cccaggccct 
agcccagcat 
gagaccctgg 
aggaggcctg 
tctttttgga 
cctccatgcc 
cctgtctcgc 
cttccagtac 
cccaatttcl: 
ttcctcacct 
agaat ttggg 
actgaaggat 
acaactgtat 
gcaccgtggc 
ctatggaact 
gagggtgatg 
tccccatccc 
ttaagggatg 
ttgcagttac 
gacacacagc 
agaggcagct 
gaaatgtaga 
gtgcagaggg 
gctccagaac 
aacccgccct 
cctgaagaag 
gggctctgtt 
aggggtgt.cc 
gtagacagcg 
gcagatcctg 
ggcaaccacc 
ggctgaggcc 
cagacagaat 



tgggcaggga 
ggggaaggac 
aacgctattg 
gaggctggcc 
ggctgggtgc 
tgtgcacatg 
cgtgtgtgaa 
gccggcatgg 
aggtcttgca 
ccacagcagc 
attcctcccc 
aggatggcat 
agacagcaga 
agggcagcca 
acaggggtca 
atgggtgagg 
tcagcacctg 
cagcacctgg 
ccagggcccc 
acgggtcagg 
gaatgggcct 
ggccgtgcag 
gaagaagaag 
cttccactac 
agtggccagg 
gggctgcctt 
gagcggctgg 
ctggaggtca 
tggatcagcc 
ggtgaggcca 
ccttctccac 
tcctgccggg 
atgagagcag 
gcttcccagc 
ccagaaaatc 
ggagctcctg 
cactgcaggg 
ccaagcctgc 
caggctacag 
agatttgcag 
ctgtgtgggg 
agctctgtcc 
ggaggactcc 
tgcatgggag 
gcctgtgggt 
tccctggatg 
gaagaggctg 
ccagggaatc 
tttgctcagg 
cgggggctga 
tttcccttgc 
cgccaggaga 
gagctgccct 
tccctgtggc 
ctggggctga 



gaggctgatg 

tggatgagga 

ggaatagtgg 

tatagatgac 

gcataatgga 

ctgtgtgttt 

tgtatgtgtg 

gtgtagtgtt 

tgggcgcacg 

cagctggaca 

accattccct 

gaagcacaaa 

gcagacgccc 

ggggctctcc 

cctcctttct 

tggtgccaga 

tgggcagaag 

aggccgctct 

cagtcagctc 

gcctgtgtct 

gaatggcgct 

aggttcctcc 

gtgctgcaga 

accatagaag 

gatggggatg 

gtgctcagca 

gcctggttgg 

tgttcaaatc 

ccaaggtgtg 

gggacccggg 

cacccagtgg 

gagattggtg 

ggagatttgg 

accaagatct 

taccaggaac 

ttgaaggcgg 

agcgtggaca 

ctcccaggca 

gccccacatt 

gggaggggaa 

atccatgctc 

ccactggaaa 

actgggcacc 

ggcagggttg 

gcacagtggg 

atgctgaggt 

caggggacct 

tggcccaagt 

gcatggatgt 

gtcctcctgt 

tgatgacgct 

gcctggccgc 

tgctgcgggc 

cctggccccc 

taaacagcgt 



cttggagtcg 

tgtcacagtg 

ggtttaggta 

atctgggaga 

aggaagggga 

ctctgtacct 

gtgtgtgtgc 

tgtgcacaca 

tgtgcatgtg 

taaataaagg 

gggggcatcc 

gcttggtttc 

cagaggcact 

cctcaggaca 

tggagaaaag 

cttgggtcgc 

ctaccatctc 

gagaaaaggc 

cctgcaggcc 

tgctggggcg 

tcaaccgatt 

cgatggtgga 

acgcccgggg 

gtgtgggcca 

gaggactgaa 

gtgcatcctc 

ccacagcccc 

caccgtccag 

gaaggagcac 

cagtgctatg 

ggaatggagg 

cgaggctggg 

gtgtcggggc 

gagggctgtc 

tggccttcaa 

aactgtcact 

cggtcaggcc 

ctgcctgcca 

tctgttgccc 

atgtgagctc 

caggctggaa 

gctctgggtg 

cttcccagtg 

tggggaagac 

gagaactcaa 

gtggccccct 

gggccttggt 

ggtcatcaag 

ctccacccct 

gcaaggtctg 

ctttgagctg 

cgcagccagc 

ggccctcaag 

tgctggagag 

cacccagcag 



gtcacctgca 
gtgacagccc 
aaagggcacc 
gagtcaggac 
gcgctcctgt 
gcattgcaca 
acaagtgtct 
tgcacatgcg 
catcttctgc 
agttttgcag 
atggagcccc 
tgtcctgcag 
gtgcccaggg 
ctgtgtgggt 
ccctaccctg 
caggtcccgg 
atccctgctt 
tgcagctcga 
gattcccctt 
gcctcacaag 
gcggctgaac 
tgcagtggcc 
gagcctgacc 
tgcgggaagg 
gggagtgtgg 
cccgcagcca 
agttctgcca 
ctcatgttca 
tttgaggcct 
gggaagggac 
ccacagggag 
gctgggctzgg 
agtctgggca 
ccctgctccc 
ccgccctcaa 
agaagccatc 
agcaaccagc 
atgccacacg 
tcagccttcc 
cccctcacat 
gaaagtcgga 
tacaaggaga 
tgcccggtca 
agcagcacag 
tggaagcaga 
gccctaatgg 
ggagatgggg 
gtttcagatc 
cagagggagg 
accctgcaga 
gctcggaacc 
atcagcgaac 
gagaccttga 
cagcccccac 
cccattcccc 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 
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tgcacctgct cttcctcccc ctcaaggaca gggagctctt cttcctctgg aatccctctt 5340 

caacgccctg gggattaacg tggggcatgt ccttctgcgc tcggggctgc ttaagttagg 5400 

gga^gtttgg ctgggctcag caggtgcaag gaagcacttc ctacgacctg ggcttcccat 546 0 

ggatctggga cctctgcggg gtcttcggta ggaagggtgc tgagagcaca gggagcccca 552 0 

tccagctgag gaccctttct gtggatgccc ccacctccag gctctaccct gtgggtctgt 5580 

ttttggagcg agtggtgagc tcagacttgg tgcttcagaa ctaccacatc ccagctgggg 564 0 

tgagtgagcc ccacacccct cgagctgaga acctccctcc ccagtcattc cctgatccct 5700 

gctctgcacc gtccgcagac attggtacag gttttcctct actcgctggg tcgcaatgcc 5760 

gccttgttcc cgaggcctga gcggtataat ccccagcgct ggctagacat caggggctcc 5820 

ggcaggaact tccaccacgt gccctttggc tctiggcatgc gccagtgcct cgggcggcgc 5880 

ctggcagagg cagagatgct gctgctgctg caccacgtaa gcaggcctgg gggcgggggc 5 94 0 

gggacctggg cagcagaggc gggacctgca cactgggggc ggggcttgca tggtgtgatt 6 000 

gacacctggg aacagtggat ggggccttgg tcggttgagg tcggcgtgac cagggaggat 6 06 0 

ctgtgctgag caagacaggg taggatctgg gtgaggttgc ttctaaacat tgaaatgggg 6120 

actaggggag tggggtggag cctgtacaga ataatggggc ttgggcaaga cctgggcagg 6180 

attcagtctg ggcctggtcc gcaaggtggg gccggtcaga aatgggatag gttggggccc 6240 

aggctgctgc tcccccttca gcataattgt tgcacctggg acgatgggag gaagctgccc 6300 

caggtccatg ggctactgac caggccagat ggaaacccag cctctgtcct aggtgctgaa 6360 

gcacttcctg gtggagacac taactcaaga ggacataaag atggtctaca gcttcatatt 6420 

gaggcctggc acgtcccccc tcctcacttt cagagcgatt aactagtctt gcatctgcac 6480 

ccagggtccc agcctggcca ccagcttccc tctgcctgac cccaggccac ctgtcttctc 6 540 

tcccacgtgc acagcttcct gagtcacccc tctgtccagc cagctcctgc acaaatggaa 6600 

ctccccaggg cctccaggac tggggcttgc caggcttgtc aaatagcaag gccagggcac 6660 

agctggagac gatcttgctg gcagggcctg gccttgtccc cagccccacc tggccccttc 6720 

tccagcaagc agtgccctct ggacagcttg actctactcc tcccagcgct ggctccaggc 6780 

tcctcatgag gccatgcaag ggtgctgtga ttttgtccct tgccttcctg cctagtctca 6840 

catgtccctg tccctctcgc cctggccagg gcctctgtgc agacagtgtc agagtcatta 6 900 

agcgggatcc 6 910 



<210> 135 

<211> 2476 

<212> DNA 

<213> Homo sapien 



<400> 135 



gaattcttgt 
ttttttgctt 
tgtgttctaa 
catttcaata 
cttcacttcg 
aaaccatcag 
ggatttctgg 
gtttctagca 
cctctatggg 
aaagtttzttg 
atgagtgttg 
tggcaagcat 
acattttatt 
gctttcccac 
cttggtttta 
ttactgaaga 
gcagctgctg 
ctggatgctc 



tttacaagcc 
ttgacaaaca 
gagttctatg 
tgaagggcaa 
ggcttgtgaa 
ataagcattt 
tcaatattgt 
tatacatctt 
caacctatta 
gttcttttct 
ataggtacca 
cttatatagt 
ttcgagacgt 
ctgagaaata 
ttatcccttt 
cgaatagcta 
ttgttctggc 
tggcctggat 



atggctctgt 
ttcaaaatgc 
ttttttctcc 
ctccaccctt 
catctctggc 
agatgcaatt 
cgtggttaca 
caacctcgct 
ttcttataga 
taccctgaac 
atctgtcatc 
tccccttgtt 
cagaaccatt 
tgcccaatgg 
aatattcata 
tgggaagaac 
cttcatcatt 
gggtgtcatzt 



ttcttaatgt 
taatgattca 
acagaaggca 
gccactacta 
aacaatgagt 
cctattcttt 
ctgttttgtt 
gtggctgatt 
ratgactggc 
atgtttgcaa 
cacccctttc 
tggtgtatgg 
gaatacttag 
ccagctggga 
gcaacatgct 
aggataaccc 
cgctggcttc 
aatagctgcg 



tttctataat cactcacttt 



aggatgtcct 
taagaactag 
gcaaaaacat 
ctaccttgaa 
actacattat 
gtcaaaaggg 
tactcctttt 
tctttggacc 
gcattttttt 
tgtctcaaag 
cctgtttgtc 
gagtgaatgc 
ttgccttaat 
attttggaat 
gcgaccaagt 
ccttccatgt 
aagttatagc 



cagctctgta 
gagctgctga 
taccagcggt 
ctgttcacag 
atttgtaatt 
tcctaaaaag 
ggctactctt 
tgtgatgtgc 
tatcacctgc 
aagaaatccc 
ctcattgcca 
ttgcattatg 
gaaaaatatc 
tagaaaacac 
cctgaagatg 
tctgaccttc 
agtcattgac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ctggcacttc 
tattgttttg 
acttggctcc 
atggagacct 
ctttgaggct 
atcttttctg 
gaattgccca 
caacctagaa 
gatttgggga 
ccatttttat 
acaagacaac 
tactggatta 
cttgcatttc 
aggaatatag 
agatgtacta 
aagcactttt 
ctgatacttt 
tcacatcctt 
ttattaatgc 
tttaacttgt 
acatggggtc 
gaacctgtca 
ctatatctat 
caattgcagg 



cttttgccat 
ttggaaaccg 
aagggaaaag 
ttgtgtctta 
caccagaatt 
aatcttctga 
ttgtttttct 
gtaactggtg 
ttcagatttc 
caggatttcc 
attgtaagag 
ttcaggcctt 
acttgagtgg 
atcaaatcat 
taacatagaa 
taaaaaactt 
tttgaaatca 
gactttttag 
tttggttctg 
aataaaccct 
tgggtggggg 

gggcagtaca 
aaacactgtt 
tctaga 



cctcctggga 
gttccaacag 
agagagtatg 
aacgtgagag 
atttttaagt 
aaccaaatgt 
gatatgtttg 
a t a tat c tea 
tctttgaaac 
tcttgaacca 
agatgagcac 
aggcatatgc 
aggtttatag 
actcctatgc 
ttgeaatcta 
tctactcatt 
gtaaacactg 
atgtgctgct 
ggttgtttcc 
taactggcat 
caaagagacc 
atgtgacttt 
tgttccagaa 



ttcaccaaca 
aagctccgca 
ccttgccgga 
caaaatgeat 
ggttttaata 
aactatgttt 
tacaagattt 
aattgtaatt 
atgcttgtgt 
gaaccagtct 
ttctaagttg 
ttctttaaaa 
ttaatctata 
tttagcttat 
taatatttgt 
ttaatgattg 
tgtattgttg 
ttgatatata 
taaaatatct 
aggaaatggt 
cagtcaatta 
gaaaatatat 
tctgtatgat 



gctgcgttaa 
gtgtgtttag 
aaagcagttc 
gtaatcaaca 
aaataataaa 
tategtccag 
tcattggtga 
aataatagat 
ttcttagtgg 
ttcaactcat 
agtatattat 
aegctataaa 
actacatatt 
ttttacagtt 
gtgttcacta 
tttaaaggtt 
taaaatgtaa 
ggacattgat 

gggtggctta 

atccagaatg 
catgtttggt 
accgtggggg 
tctatggagc 



tccgtttctg 
ggttccaatt 
tcttagagaa 
tggctacttg 
acttccccta 
ngactttcag 
gacatattta 
tgtgaataat 
ggttttatac 
tgcatcattt 
aatagattag 
- tatatcccc 
gaatagggct 
atagaaagca 
aactctgaat 
tctattttct 
aggtcacttt 
ttgattttta 
aaaaaaactc 
gaattttget 
accaagaaag 
tagttttacc 
tattttaaac 



<210> 136 

<211> 3100 

<212> DNA 

<213> Homo sapien 



<400> 
gaattcctga 
tataatttaa 
gaaaaaaagt 
tgtgtacggc 
gecttatega 
ctctattccg 
ccctatgcct 
aacttcttcc 
ggaggccggg 
tcacctgagc 
aaaatacaaa 
tgaggcagga 
actgaactcc 
cgccaggagt 
ctttctaacg 
ctgataaatc 
atgatgctga 
ggttaaaact 
acagctctgg 
accatcaacg 
gtgagcatta 
tcaaggactc 
gctgaggaca 
attcatggct 



136 
gatccctgta 
tgtattatac 
ttcaggaaag 
aaagttactg 
cacagaaaag 
gtaacaccac 
caccctgcag 
aatcccttgg 
gcagtggctc 
tcaggagttc 
attagcegtg 
gaatcacttg 
agectgggea 
ctcaggcttt 
ctcttggttt 
atgggcaggg 
ggtttgggcc 
ggttgatttt 
aagteggtet 
aacatgatgt 
tataacattt 
aattagagat 
ctacaggagc 
ttcttgtccc 



ctggtactcc 
agaaattegg 
ttttccaaca 
gcaatctggc 
caatgccttc 
ttgacagaaa 
ctcagcactc 
ctattttgag 
atgectctat 
gagacgagee 
tgtggcggca 
aaccegggag 
acaaaagtga 
cactgacttg 
cctggccact 
ggtgaataaa 
taaagcaaat 
atataaactt 
ctccttgcaa 
tcgttacggg 
tcatgcaccc 
tttcttaatt 
caaattagag 
ctcaccccca 



igctagaata 
agaaacttga 
ttccaatgtc 
ttctgccctg 
cacccttcgg 
gatggcaagg 
agaactcctt 
actgcccctt 
ccagcacttt 
taccaacatg 
cacgcctgta 
gecagaggtt 
aactctgtct 
ganaacttgg 
cctcccaacc 
caaatcactc 
tcatatgttt 
gaaaaacatt 
ggtaaggata 
cgctgttctt 
ttacataaac 
tttttatttt 
aagggggctt 
gtataatcct 



accttctggt 
cgtatctctg 
acgtgtgttt 
aacctctacg 
gggcatttaa 
aaaagectet 
tcctgggcag 
gcgggtccag 
gggaggccaa 
gtaaaacgee 
atcccagcta 
gcagtgggcc 
caaaaaaata 
ggtgtagtct 
tccccaagtt 
cccagtttta 
gcatttatgt 
actttttget 
tgagcttggt 
attccatget 
aattagaagc 
tttttttatt 
gcagtcctgc 
ttttgecagg 



ttcgagggtt 
aatgagcaga 
atatatatac 
ccaagttctg 
ggttgacaca 
tccccacatt 
egtcaattte 
aaggaageca 

ggtgggtgga 
atctccacta 
cttgacaggc 
aagatcaege 
aaaagaaaga 
ccttcctccc 
ggaaaatcag 
catactacat 
aagagaaagt 
tttgacctgg 
catattgatg 
agaaggacca 
tgtttactcc 
teaggectga 
catattgeat 
ttaacaactc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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actaatttca 

gttccaactt 

ctcacccctc 

aaccaggcac 

gtaccttttt 

aaaatacagg 

cggtggtacg 

catgagcata 

acgattccaa 

catctactgc 

gacagatgcg 

accacctcca 

gaagggtttt 

actcctccaa 

gtcccctccc 

tcgaggggat 

gcgcaggagc 

cgcaggagtg 

gcgcggaagg 

tccgaaggaa 

ggagccgctc 

agacgggggc 

tggcttgggg 

gcgctcagaa 

gcagcaggca 

gggaaaacgg 

cgacctccct 

gccctgccac 



aagcaattca 
taaattgct t 
caaaactaat 
gaagaccaag 
gtttgtattg 
tgttatcatt 
tttactctgt 
catttttgca 
agggccctca 
actcgaggtt 
cgcttggctc 
atcccccaag 
acacacaccc 
gcctggcgga 
ccgtcaaggg 
gcggggggag 
gggtgcggga 
ggtgcgggag 
gggcgcgggg 
cccgcgggcg 
aattgccaac 
gtgcctttgc 
agtgggagcg 
acatgctgaa 
ggcaggcagg 
gcccggcgcc 
ctgcgcacca 
accccccgcc 



tttgccaact 
ttggatttta 
ctcagagttt 
gatctgaaat 
acccagctca 
gaaaagtcta 
atctcaacgt 
tcatgctcac 
gcgttagctg 
caacagagga 
cagcaaccgc 
ccgtcacgac 
cgctaggctg 
caagctgtcc 
agctagcgct 
cgggggcgca 
ggcaaacggg 
cgagtggggg 
gaacagggac 
cacgggaggc 
gggagggggg 
ccgcattggc 
gggtggggtg 
gticccggcgg 
caggcaggca 
catgctccgg 
cgccgcccgg 
cccggcctcc 



caaaaaaaaa 
atcctctggg 
tgggaatctg 
cgcagttcat 
aattataaaa 
aaatgtaatt 
ttctgtctcc 
acgttcatta 
ttagatgcac 
tcttgaagaa 
tctgctggga 
gcatgctggc 
ccccacatca 
caggcgctct 
ctgttccgca 
ggagggagcc 
gcgcaggagt 
ctcaggaggg 
caggaaccag 
actagctacg 
tgcggggagt 
tgcaggagcc 
ggtgctgggt 
ctcttccagc 
ggcagcggca 
ccccgcgggg 
gcttctgggg 
gcagctcggc 



aaaaaaaaaa 
gtatttatat 
ggaacttggg 
tacgtcagca 
tcacatgagt 
aaacgcgttt 
ctaagcctct 
gctaggactg 
aaaccttcgc 
cagcgccaga 
agtttcttct 
tgggtcccgt 
caaccaagct 
ggcgcttcct 
gagaaccccg 
gagtgctgga 
gggtgcggga 
gtcgggcccg 
cgggcgcagg 
cgatcagctc 
tggaggtggg 
tgacgcgagg 
gccggagctg 
agcaggcagc 
gcgacagcgc 
cggctgccct 
tgttccccaa 



aaaacccaca 
gtttccttgg 
caaaggggga 
taaactgaca 
aataaaacac 
ttccccctcg 
cctttcatat 
aggagtgtgt 
ttcctttcca 
gagcattctt 
aaccactaac 
cttgacgggg 
cgcagggcaa 
gaacaccaag 
gaactgcagg 
ggcaaacggg 
ggcaaacggg 
ggggagccag 
aaggggtgcg 
gggactctca 
ggagcagacc 
ccccgggggt 
cgggcccggc 
ggctccagca 
tcggctctgc 
gacccggccg 
ccacggccca 



<210> 137 

<211> 1723 

<212> DNA 

<213> Homo sapien 



<400> 
tgctacccgc 
cccgcccccg 
agcccggcaa 
tgctggtgcc 
cgctgtctca 
tcgtggcggg 
tcaccaacct 
tgccgttcgg 
agctgtggac 
ttgccctgga 
gcgcgcgggc 
tgcccatcct 
accccaagtg 
ccttctacgt 
agaagcaggt 
cgccctcgcc 
ccgccgccgc 
cgcgcctcgt 
tcttcacgct 
agctggtgcc 



137 
gcccgggctt 
gcctccgcag 
cctgtcgtcg 
cgcgtcgccg 
gcagtggaca 
caatgtgctg 
cttcatcatg 
ggccaccatc 
ctcagtggac 
ccgctacctc 
gcggggcctc 
catgcactgg 
ctgcgacttc 
gcccctgtgc 
gaagaagatc 
ctcgccctcg 
cgccgccacc 
ggccctacgc 
ctgctggctg 
cgaccgcctc 



ccggggtgtt 
ctcggcatgg 
gccgcaccgc 
cccgcctcgt 
gcgggcatgg 
gtgatcgtgg 
tccctggcca 
gtggtgtggg 
gtgctgtgcg 
gccatcacct 
gcgtgcaccg 
tzggcgggcgg 
gccaccaacc 
accatggcct 
gacagctgcg 
cccgtccccg 
gccccgctgg 
gagcagaagg 
cccttcttcc 
ttcgtcttct 



ccccaaccac 
gcgcgggggt 
tccccgacgg 
tgctgcctcc 
gtctgctgat 
ccatcgccaa 
gcgccgacct 
gccgctggga 
tgacggccag 
cgcccttccg 
tgtgggccat 
agagcgacga 
gggcctacgc 
tcgtgtacct 
agcgccgttt 
cgcccgcgcc 
ccaacgggcg 
cgctcaagac 
tggccaacgt 
tcaactggct 



ggcccagccc 
gctcgtcctg 
cgcggccacc 
cgccagcgaa 
ggcgctcatc 
gacgccgcgg 
ggtcatgggg 
gtacggctcc 
catcgagacc 
ctaccagagc 
ctcggccctg 
ggcgcgccgc 
catcgcctcg 
gcgggtgttc 
cctcggcggc 
gccgcccgga 
tgcgggtaag 
gctgggcatc 
ggtgaaggcc 
gggctacgcc 



tgccacaccc 
ggcgcctccg 
gcggcgcggc 
agccccgagc 
gtgctgctca 
ctgcagacgc 
ctgctggtgg 
ttcttctgcg 
ctgtgtgtca 
ctgctgacgc 
gtgtccttcc 
tgctacaacg 
tccgtagtct 
cgcgaggccc 
ccagcgcggc 
cccccgcgcc 
cggcggccct 
atcatgggcg 
ttccaccgcg 
aactcggcct 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3100 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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tcaaccccat 
gctgcgcgcg 
cgggctgtct 
acgacgatgt 
acggcggggc 
cctcggaatc 
gggaacgagg 
cctcgtctga 
tttgggaagg 



catctactgc 
cagggctgcc 
ggcccggccc 
cgtcggggcc 
ggcggcggac 
caaggtgtag 
agatctgtgt 
atcatccgag 
gatgggagag 



cgcagccccg 
cgccggcgcc 
ggacccccgc 
acgccgcccg 
agcgactcga 
ggcccggcgc 
ttacttaaga 
gcaaagagaa 
tggcttgctg 



acttccgcaa 
acgcgaccca 
catcgcccgg 
cgcgcctgct 
gcctggacga 

ggggcgcgga 

ccgatagcag 
aagccacgga 
atgttccttg 



ggccttccag 
cggagaccgg 
ggccgcctcg 
ggagccctgg 
gccgtgccgc 
ctccgggcac 
gtgaactcga 
ccgttgcaca 
ttg 



ggactgctct 
ccgcgcgcct 
gacgacgacg 
gccggctgca 
cccggcttcg 
ggcttcccag 
agcccacaat 
aaaaggaaag 



<210> 138 

<211> 3451 

<212> DNA 

<213> Homo sapien 



<400> 
cccgggttca 
ccaccacacc 
aggatctttt 
tgaaccatat 
tggcacaacc 
ttgggtttac 
gtgagtgatg 
cacacaactt 
tccaaagggt 
gacagcgagt 
atgtttgaaa 
gctcgggtga 
atgcctccaa 
ggtgcttctg 
gatggcttct 
tcagtgtcta 
cctgtatgtg 
tgtgctgccc 
gccacagaag 
cctctggggc 
gggccccgcc 
gagtgcctcg 
ggcgagggca 
cttcttcaga 
gagtgtgcag 
cgtccgctcg 
gtgcgcttac 
ctggcaccca 
tgggtggtgg 
gtgctggtca 
atcacttcac 
gcccatattc 
attgatgtgc 
tactttgcca 
gtgatcattc 
cactggtacc 
gacttcttca 
ctggtgatca 
aagattgaca 



138 
agagattctc 
tggctaattt 
gctttctata 
gaatttgcca 
cgaaagatta 
tgaagaaatt 
ccacactctc 
tctctctctg 
aaaactgtct 
gtgctgagga 
gggaatacac 
ggcaagttcg 
gccagcgtgt 
tgtttgtttc 
gtgtgagagc 
tggctgtggt 
cgtgcctcgg 
tctgccttga 
agccaaaagc 
cagccagggt 
cgggccagcc 
ccccttcgcg 
gttcccctaa 
gcacgggctg 
gacgagtccc 
cggcccgcag 
ctgccagact 
atagaagcca 
gcatgggcat 
tcacagccat 
tggcctgtgc 
ttatgaaaat 
tgtgcgtcac 
ttacttcacc 
tgatggtgtg 
gggccaccca 
cgaaccaagc 
tggtcttcgt 
aatctgaggg 



ctgtctcagc 

ttgtattttt 

gcttcaaaat 

ttttggtaag 

acaaactatc 

gttztgaattc 

aagagttaaa 

tcccaaaata 

tcatgcctgc 

aatcggcagc 

tgggttaccg 

gagtacccag 

gtttactttc 

tggccgcgtt 

gcgcgcgagt 

tcggtataag 

tgggcaqtct 

gacctcaagc 

tcccgggttg 

agccgggaag 

ccaggagaag 

gctgccggcg 

agtcctgtgc 

gaactggcag 

caccacaccc 

agccccgccg 

gcgcgccatg 

tgcgccggac 

cgtcatgtct 

tgccaagttc 

tgatctggtc 

gtggactttt 

ggccagcatt 

tttcaagtac 

gattgtgtca 

ccaggaagcc 

ctatgccatt 

ctactccagg 

ccgcttccat 



ctcccgagta 

agtagagaca 

gttcttaatg 

tcacagacgc 

cagcagatga 

tcattgcatc 

aacaaaacaa 

catacttgca 

aaattcctaa 

tgttgaagtc 

ggtgtatgtt 

atggagacat 

tgtgtgtgtc 

tctgtgttgg 

gtgcatgtcg 

tctgagcatg 

cgtttccttc 

cgcgcaggcg 

gctggtaagg 

cagtggtggc 

gagggcgagg 

tgccattggc 

acataacggg 

gcaccgcgag 

acaccacagc 

tgggtccgcc 

gggcaacccg 

cacgacgtca 

ctcatcgtcc 

gagcgtctgc 

atgggcctgg 

ggcaacttct 

gagaccctgt 

cagagcctgc 

ggccttacct 

atcaactgct 

gcctcttcca 

gtctttcagg 

gtccagaacc 



gctgggacta 

agagttacac 

ttaagacatt 

cagatggtgg 

aaggattttt 

tccagttcaa 

caaaaaaatt 

tacccccgct 

ggagggcacc 

acctcctgtg 

gggaggggag 

ccgtgtctgt 

accatgtctt 

acaggggtga 

gtgagctggg 

tctgccaggg 

cgaatgtggg 

cccagggcag 

acaccacctc 

ccgccctcca 

ggaggggagg 

cgaaagttcc 

cagaacgcac 

cccctagcac 

cgctgaatga 

cgctgaggcg 

ggaacggcag 

cgcagcaaag 

tggccatcgt 

agacggtcac 

cagtggtgcc 

ggtgcgagtt 

gcgtgatcgc 

tgaccaagaa 

ccttcttgcc 

atgccaatga 

tcgtgtcctt 

aggccaaaag 

ttagccaggt 



caggtacgtg 
catattggcc 
cttaatactc 
caatttcaca 
tttagtttca 
cagataatga 
aaaacaaaag 
ccagataaaa 
taaagcactt 
ctcttgccaa 
cattatcagt 
gtcgctctgg 
tgtgcttctg 
ctttgtgccg 
agggtgtgtc 
tgtatttgtg 
gcagtgccgg 
gcaggtagcg 
cagctttagc 

gggagcagtt 

gaaaggggag 
cgtacgtcac 
tgcgaagcgg 
ccgacaagct 
ggcttccagg 
cccccagcca 
cgccttcttg 
ggacgaggtg 
gtttggcaat 
caactacttc 
cttcggggcc 
ttggacttcc 
agtggatcgc 
taaggcccgg 
cattcagatg 
gacctgctgt 
ctacgttccc 
gcagctccag 
ggagcaggat 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1723 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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gggcggacgg ggcatggact ccgcagatct tccaagttct gcttgaagga gcacaaagcc 
ctcaagacgt taggcatcat catgggcact ttcaccctct gctggctgcc cttcttcatc 
gttaacattg tgcatgtgat ccaggataac ctcatccgta aggaagttta catcctccta 
aattggatag gctatgtcaa ttctggtttc aatcccctta tctactgccg gagcccagat 
ttcaggattg ccttccagga gcttctgtgc ctgcgcaggt cttctttgaa ggcctatggg 
aatggctact ccagcaacgg caacacaggg gagcagagtg gatatcacgt ggaacaggag 
aaagaaaata aactgctgtg tgaagacctc ccaggcacgg aagactttgt gggccatcaa 
ggtactgtgc ctagcgataa cattgattca caagggagga attgtagtac aaatgactca 
ctgctgtaaa gcagtttttc tacttttaaa gacccccccc cccccaacag aacactaaac 
agactattta acttgagggt aataaactta gaataaaatt gtaaaaattg tatagagata 
tgcagaagga agggcatcct tctgcctttt ttattttttt aagctgtaaa aagagagaaa 
acttatttga gtgattattt gttatttgta cagttcagtt cctctttgca tggaatttgt 
aagtttatgt ctaaagagct ttagtcctag aggacctgag tctgctatat tttcatgact 
tttccatgta tctacctcac tattcaagta ttaggggtaa tatattgctg ctggtaattt 
gtatctgaag gagattttcc ttcctacacc cttggacttg aggattttga gtatctcgga 
cctttcagct gtgaacatgg actcttcccc cactcctctt atttgctcac acggggtatt 
ttaggcaggg atttgaggag cagcttcagt tgttttcccg agcaaaggtc taaagtttac 
agtaaataaa atgtttgacc atgccttcat tgcacctgtt tgtccaaaac cccttgactg 
gagtgctgtt gcctccccca ctggaaaccg c 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3451 



<210> 139 

<211> 2305 

<212> DNA 

<213> Homo sapien 

<400> 139 

gaattcatgc cgcgtttctg tgttggacag gggtgacttt gtgccggatg gcttctgtgt 60 

gagagcgcgc gcgagtgtgc atgtcggtga gctgggaggg tgtgtctcag tgtctatggc 120 

tgtggttcgg tataagtcta agcatgtctg ccagggtgta tttgtgcctg tatgtgcgtg 180 

cctcggtggg cactctcgtt tccttccgaa tgtggggcag tgccggtgtg ctgccctctg 24 0 

ccttgagacc tcaagccgcg caggcgccca gggcaggcag gtagcggcca cagaagagcc 300 

aaaagctccc gggttggctg gtaagcacac cacctccagc tttagccctc tggggccagc 360 

cagggtagcc gggaagcagt ggtggcccgc cctccaggga gcagttgggc cccgcccggg 42 0 

ccagcctcag gagaaggagg gcgaggggag gggagggaaa ggggaggagt gcctcgcccc 4 80 

ttcgcggctg ccggcgtgcc attggccgaa agttcccgta cgtcacggcg agggcagttc 54 0 

ccctaaagtc ctgtgcacat aacgggcaga acgcactgcg aagcggcttc ttcagagcac 60 0 

gggctggaac tggcaggcac cgcgagcccc tagcacccga caagctgagt gtgcaggacg 66 0 

agtccccacc acacccacac cacagccgct gaatgaggct tccaggcgtc cgctcgcggc 72 0 

ccgcagagcc ccgccgtggg tccgcctgct gaggcgcccc cagccagtgc gcttacctgc 780 

cagactgcgc gccatggggc aacccgggaa cggcagcgcc ttcttgctgg cacccaatag 84 0 

aagccatgcg ccggaccacg acgtcacgca gcaaagggac gaggtgtggg tggtgggcat 90 0 

gggcatcgtc atgtctctca tcgtcctggc catcgtgttt ggcaatgtgc tggtcatcac 96 0 

agccattgcc aagttcgagc gtctgcagac ggtcaccaac tacttcatca cttcactggc 102 0 

ctgtgctgat ctggtcatgg gcctggcagt ggtgcccttt ggggccgccc atattcttat 1080 

gaaaatgtgg acttttggca acttctggtg cgagttttgg acttccattg atgtgctgtg 1140 

cgtcacggcc agcattgaga ccctgtgcgt gatcgcagtg gatcgctact ttgccattac 1200 

ttcacctttc aagtaccaga gcctgctgac caagaataag gcccgggtga tcattctgat 126 0 

ggtgtggatt gtgtcaggcc ttacctcctt cttgcccatt cagatgcact ggtaccgggc 1320 

cacccaccag gaagccatca actgctatgc caatgagacc tgctgtgact tcttcacgaa 1380 

ccaagcctat gccattgcct cttccatcgt gtccttctac grtcccctgg tgatcatggt 1440 

cttcgtctac tccagggtct ttcaggaggc caaaaggcag ctccagaaga ttgacaaatc 1500 

tgagggccgc ttccatgtcc agaaccttag ccaggtggag caggatgggc ggacggggca 1560 

tggactccgc agatcttcca agttctgctt gaaggagcac aaagccctca agacgttagg 1620 

catcatcatg ggcactttca ccctctgctg gctgcccttc ttcatcgtta acattgtgca 1680 

tgtgatccag gataacctca tccgtaagga agtttacatc ctcctaaatt ggataggcta 174 0 
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tgtcaattct 
ccaggagctt 
caacggcaac 
gctgtgtgaa 
cgataacatt 
tttttctact 
gggtaataaa 
ccttctgcct 
tttgttattt 
gctttagtcc 



ggtttcaatc 
ctgtgcctgc 
acaggggagc 
gacctcccag 
gattcacaag 
tttaaagacc 
cttagaataa 
Cttttatttt 
gtacagttca 
tagaggaccc 



cccttatcta 
gcaggtcttc 
agagtggata 
gcacggaaga 
ggaggaattg 
ccccccccca 
aattgtaaaa 
tttaagctgt 
gttcctcttt 
gagtc 



ccgccggagc 
tttgaaggcc 
ccacgtggaa 
ctctgtgggc 
tagtacaaat 
acagaacact 
ttgtatagag 
aaaaagagag 
gcatggaatt 



ccagatttca 
tatgggaatg 
caggagaaag 
catcaaggta 
gactcactgc 
aaacagacta 
atatgcagaa 
aaaacttatt 
tgtaagttta 



ggattgcctt 
gctactccag 
aaaataaact 
ctgtgcctag 
ngtaaagcag 
tttaacttga 
ggaagggcat 
tgagtgatta 
tgtctaaaga 



<210> 140 

<211> 4105 

<212> DNA 

<213> Homo sapien 



<400> 
gaattcgcgg 
agacggggag 
cgcgcgcgcg 
catccatccc 
gcagtgccca 
cggagcccgg 
tggcccaggc 
aaaaagtgaa 
caagatggaa 
cagtgataat 
ttttcgtggc 
acccagcaat 
caaatacatt 
aactctgctc 
agccagtctt 
taagctgctg 
gctgttcccc 
tagtgttgac 
tttggtaact 
cgaagcgatt 
tatgctcaat 
cttcgggttc 
ttgtgagatg 
gcagcgtcga 
gttccctctt 
ccgatgtgaa 
catgaattca 
tttccagtca 
catgaacgga 
gagcagccat 
aatcctctcg 
cttcttcctt 
ctggtttatc 
ttctgcgtgt 
aaatgaaacc 
ctagctttta 
agattaacga 
atatgggctc 



140 
ccgcctcttg 
gacagactgg 
tacagtcatc 
acccggtcgt 
ggaagttttc 
gacaccggcc 
gccgcgcgga 
ggtgtaaaag 
accctttgcc 
cctgagagat 
acagagctca 
ggctcaatgc 
aacactgtga 
aggatcattt 
gcccttggag 
gctgggcgct: 
tttttgcaga 
aggtacagag 
gccattgaaa 
ggcttcgtca 
gccacatcaa 
tatttctgta 
ttgaacagaa 
gaagtggcaa 
cacttaagcc 
ttacttagtt 
tgtataaacc 
tgcctctgct 
acaagcatcc 
aaggacagca 
gagaaaaaaa 
aattcactcc 
cacccacaac 
tgtattcagc 
agaaggatat 
tggcagtcct: 
caagattttc 
aggtcactct 



cggtcccaga 
aggcgtgttc 
ccgctggtct 
cgccggggat 
tgaagccggg 
accctccgcg 
cccggcagct 
cagcacaagt 
tcagggcatc 
acagcacaaa 
gcttcctggt 
acaactattg 
tatcttgtac 
accagaacaa 
accttatcta 
ggccttttga 
agtcctcggt 
cagttgcctc 
ttgtctccat 
tggtaccctt 
aattcatgga 
tgcccttggt 
nnaatggcag 
aaacagtttt 
gtatattgaa 
tcttactgct 
ccatagctct 
gctgctgtta 
agtggaagaa 
tgaactgacc 
ccacaaggca 
cacacccaag 
atctacgaat 
actaaaaaat 
ttactacttt 
ggtgaatgtt 
tacttttttt 
tatttgaaat 



gtggagtgga 
ctccggagtt 
gacgattgtg 
tggggtccca 
gaagctgtgc 
ccacccaccc 
gtctgcgcac 
gcaataagag 
cttttggctg 
tctaagcaat 
taccactcat 
cccacagcag 
tattttcatc 
atgtatgagg 
tgtggtcatt 
tcacaatgac 
ggggatcacc 
ctggagtcgt 
ctggatcctg 
tgaatatagg 
gttctaccaa 
gtgcactgcg 
cttgagaatt 
ctgcttggct 
gaaaactgtg 
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